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(57) Abstract 



Cells are genetically modified to expresss a luminophore, e.g., a modified (F64L, S65T, Y66H) Green Fluorescent Protein (GFP, 
EGFP) coupled to a component of an intracellular signalling pathway such as a transcription factor, a cGMP- or cAMP-dependent protein 
kinase, a cyclin- calmodulin- or phospholipid-dependent or mitogen-activated serine/threonin protein kinase, a tyrosine protein kinase, or 
a protein phosphatase (e.g. PKA, PKC, Erk, Smad, VASP, actin. p38. Jnkl, PKG, IkappaB, CDK2, Grk5, Zap70, p85, protcin-tyrosinc 
phosphatase IC. Stat5. NFAT. NFkappaB, RhoA, PKB). An influence modulates the intracellular signalling pathway in such a way that 
the luminophore is being redistributed or translocated with the component in living cells in a manner experimentally determined to be 
correlated to the degree of the influence. Measurement of redistribution is peifonned by recording of light intensity, fluorescence lifetime, 
polarization, wavelength shift, resonance energy transfer, or other properties by an apparatus consisting of e.g. a fluorescence microscope 
and a CCD camera. Data stored as digital images are processed to numbers representing the degree of redistribution. The method can be 
used as a screening program for identifying a compound that modulates a component and is capable of treating a disease related to the 
function of the component. 
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A METHOD for extracting quantitative information relating to an Influence on a cellular re- 
sponse 



FIELD OF INVENTION 

5 The present invention relates to a method and tools for extracting quantitative information 
relating to an influence, on a cellular response, in particular an influence caused by contac- 
ting or incubating the cell with a substance Influencing a cellular response, where the cellular 
response is manifested in redistribution of at least one component in the cell. In particular, 
the invention relates to a method for extracting quantitative information relating to an influen- 

10 ce on an intracellular pathway Involving redistribution of at least one component associated 
with the pathway. The method of the Invention may be used as a very efficient procedure for 
testing or discovering the influence of a substance on a physiological process, for example In 
connection with screening for new drugs, testing of substances for toxicity, identifying drug 
targets for known or novel drugs? Other valuable uses of the method and technology of the 

15 Invention will be apparent to the skilled person on the basis of the following disclosure. In a 
particular embodiment of the Invention, the present invention relates to a method of detec- 
ting Intracellular translocation or redistribution of biologically active polypeptides, preferably 
an enzyme, affecting intracellular processes, and a DNA construct and a cell for use in the 
method. 

20 

BACKGROUND OF THE INVENTION 

Intracellular pathways are tightly regulated by a cascade of components that undergo 
modulation in a temporally and spatially characteristic manner. Several disease states can 
be attributed to altered activity of individual signalling components (i.e. protein kinases, pro- 
25 tein phosphatases, transcription factors). These components therefore render themselves as 
attractive targets for therapeutic intervention. 

Protein kinases and phosphatases are well described components of several Intracellular 
signalling pathways. The catalytic activity of protein kinases and phosphatases are assumed 
to play a role In virtually all regulatable cellular processes. Although the involvement of pro- 
30 tein kinases in cellular signalling and regulation have been subjected to extensive studies, 
detailed knowledge on e.g. the exact timing and spatial characteristics of signalling events Is 
often difficult to obtain due to lack of a convenient technology. 
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Novel ways of monitoring specific modulation of Intracellular pathways in intact, living cells is 
assumed to provide new opportunities in drug discovery, functional genomics, toxicology, 
patient monitoring etc. 

The spatial orchestration of protein kinase activity is likely to be essential for the high degree 
of specificity of individual protein kinases. The phosphorylation mediated by protein kinases 
is balanced by phosphatase activity. Also within the family of phosphatases translocation has 
been observed, e.g. translocation of PTP2C to membrane ruffles [(Cossette et a/. 1996)]. and 
likewise is likely to be indicative of phosphatase activity. 

Protein kinases often show a specific intracellular distribution before, during and after activa- 
tion. Monitoring the translocation processes and/or redistribution of individual protein kinases 
or subunits thereof is thus likely to be indicative of their functional activity. A connection be- 
tween translocation and catalytic activation has been shown for protein kinases like the dla- 
cyl glycerol (DAG)-dependent protein kinase C (PKC). the cAMP-dependent protein kinase 
(PKA) [(DeBernardi ef a/.1996)] and the mitogen-activated-protein kinase Erk-1 [(Sano et 
15 a/. 1995)]. 

Commonly used methods of detection of intracellular localisation/activity of protein kinases 
and phosphatases are immunoprecipitation. Western blotting and immunocytochemical de- 

tection. 

Taking the family of diacyl glycerol {DAG)-dependent protein kinase Cs (PKCs) as an exam- 
ple, it has been shown that individual PKC isofomis that are distributed among different tis- 
sues and cells have different activator requirements and undergo differential translocation in 
response to activation. Catalytically inactive DAG-dependent PKCs are generally distributed 
throughout the cytoplasm, whereas they upon activation translocate to become associated 
with different cellular components, e.g. plasma membrane [(Farese. 1992),(Fulop Jr et 
25 a/.1995)] nudeus [(Khalil ef a/.1992)]. cytoskeleton f(Blobe ef a/.1996)]. The translocation 
phenomenon being indicative of PKC activation has been monitored using different ap- 
proaches: a) immunocytochemistry where the localisation of individual isofomis can be de- 
tected after permeabilisation and fixation of the cells [(Khalil ef a/.1992)]; and b) tagging all 
DAG-dependent PKC isofomis with a fluorescently labelled phorbol myristate acetate (PMA) 
[(Godson ef a/. 1996)]; and c) chemical tagging PKC b1 with the fluorophore Cy3 [(Bastiaens 
& Jovin 1996)] and d) genetic tagging of PKCa ([Schmidt ef al. 1997J) and of PKCy and PKC 
^ ([Sakai ef al. 1996]). The first method does not provide dynamic information whereas the 
latter methods will. Tagging PKC with fluorescently labelled phorbol myristate acetate cannot 



20 
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distinguish between different DAG-dependent isoforms of PKC but will label and show 
movement of all isoforms. Chemical and genetic labelling of specific DAG-dependent PKCs 
confirmed that they in an isoform specific manner upon activation move to cell periphery or 
nucleus. 

5 In an alternative method, protein kinase A activity has been measured in living cells by 

chemical labelling one of the kinase's subunit (Adams et a/. 1991). The basis of the method- 
ology is that the regulatory and catalytic subunit of purified protein kinase A is labelled with 
fluorescein and rhodamine, respectively. At low cAMP levels protein kinase A is assembled 
in a heterotetrameric form which enables fluorescence resonance energy transfer between 

10 the two fluorescent dyes. Activation of protein kinase A leads to dissociation of the complex, 
thereby eliminating the energy transfer. A disadvantage of this technology is that the labelled 
protein kinase A has to be microinjected into the cells of interest. This highly invasive tech- 
nique is cumbersome and not applicable to large scale screening of biologically active sub- 
stances. A further disadvantage of this technique as compared to the presented invention is 

15 that the labelled protein kinase A cannot be inserted into organisms/animals as a transgene. 

Recently it was discovered that Green Fluorescent Protein (GFP) expressed in many differ- 
ent cell types, including mammalian cells, became highly fluorescent [(Chalfie et a/. 1994)]. 
WO95/07463 describes a cell capable of expressing GFP and a method for detecting a pro- 
tein of interest in a cell based on introducing into a cell a DNA molecule having DNA se- 

20 quence encoding the protein of interest linked to DNA sequence encoding a GFP such that 
the protein produced by the DNA molecule will have the protein of Interest fused to the GFP, 
then culturing the cells in conditions permitting expression of the fused protein and detecting 
the location of the fluorescence in the cell, thereby localizing the protein of interest in the 
cell. However, examples of such fused proteins are not provided, and the use of fusion pro- 

25 teins with GFP for detection or quantitation of translocation or redistribution of biologically 
active polypeptides affecting intracellular processes upon activation, such as proteins in- 
volved in signalling pathways, e.g. protein kinases or phosphatases, has not been sug- 
gested/wo 95/07463 further describes cells useful for the detection of molecules, such as 
hormones or heavy metals, in a biological sample, by operatively linking a regulatory ele- 

30 ment of the gene which is affected by the molecule of interest to a GFP, the presence of the 
molecules will affect the regulatory element which in turn will affect the expression of the 
GFP. In this way the gene encoding GFP is used as a reporter gene in a cell which is con- 
structed for monitoring the presence of a specific molecular identity. 
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Green Fluorescent Protein has been used in an assay for the detection of translocation of 
the glucocorticoid receptor (GR) (Carey, KL et al., The Journal of Cell Biology. Vol. 133 No 
5. p. 985-996 (1996)]. A GR-S65TGFP fusion has been used to study the mechanisms in- 
volved in translocation of the glucocorticoid receptor (GR) in response to the agonist dex- 
amethasone from the cytosol. where it is present in the absence of a ligand. through the nu- 
clear pore to the nucleus where it remains after ligand binding. The use of a GR-GFP fusion 
enables real-time imaging and quantitation of nuclear/cytoplasmic ratios of the fluorescence 
signal. 

Many currently used screening programmes designed to find compounds that affect protein 
kinase activity are based on measurements of kinase phosphorylation of artificial or natural 
substrates, receptor binding and/or reporter gene expression. 



DISCLOSURE OF THE INVENTION 

The present invention provides an important new dimension in the investigation of cellular 
systems involving redistribution In that the invention provides quantification of the redistribu- 
tion responses or events caused by an Influence, typically contact with a chemical substance 
or mixture of chemical substances, but also changes In the physical environment. The quan- 
tification makes it possible to set up meaningful relationships, expressed numerically, or as 
curves or graphs, between the Influences (or the degree of Influences) on cellular systems 
and the redistribution response. This is highly advantageous because, as has been found, 
the quantification can be achieved in both a fast and reproducible manner, and - what is per- 
haps even more important - the systems which become quantifiable utilizing the method of 
the invention are systems from which enormous amounts of new Information and insight can 
be derived. 

The present screening assays have the distinct advantage over other screening assays, 
e.g.. receptor binding assays, enzymatic assays, and reporter gene assays, in providing a 
system in which biologically active substances with completely novel modes of action, e.g. 
inhibition or promotion of redlstribution/translocation of a biologically active polypeptide as a 
way of regulafing its action rather than inhibition/activation of enzymatic activity, can be iden- 
tified in a way that insures very high selectivity to the particular isoform of the biologically ac- 
tive polypeptide and further development of compound selectivity versus other isoforms of 
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the same biologically active polypeptide or other components of the same signalling . pa- 
thway. 

In its broadest aspect, the invention relates to a method for extracting quantitative informati- 
on relating to an influence on a cellular response, the method comprising recording variation, 
5 caused by the influence on a mechanically intact living cell or mechanically intact living cells, 
in spatially distributed light emitted from a luminophore, the luminophore being present in the 
cell or cells and being capable of being redistributed in a manner which is related with the 
degree of the influence, and/or of being modulated by a component which is capable of be- 
ing redistributed in a manner which is related to the degree of the influence, the association 

10 resulting in a modulation of the luminescence characteristics of the luminophore. detecting 
and recording the spatially distributed light from the luminophore, and processing the recor- 
ded variation in the spatially distributed light to provide quantitative information correlating 
the spatial distribution or change in the spatial distribution to the degree of the influence. In a 
preferred embodiment of the invention the luminophore, which is present in the cell or cells, 

15 is capable of being redistributed by modulation of an intracellular pathway, in a manner 

which is related to the redistribution of at least one component of the intracellular pathway. In 
another preferred embodiment of the invention, the luminophore is a fluorophore. 



The cells 

20 In the invention the cell and/or cells are mechanically intact and alive throughout the experi- 
ment. In another embodiment of the invention, the cell or cells is/are fixed at a point in time 
after the application of the influence at which the response has been predetermined to be 
significant, and the recording is made at an arbitrary later time. 

The mechanically intact living cell or cells could be selected from the group consisting of 
25 fungal cell or cells, such as a yeast cell or cells; invertebrate cell or cells including insect cell 
or cells; and vertebrate ceil or cells, such as mammalian cell or cells. This cell or these cells 
is/are incubated at a temperature of SC^C or above, preferably at a temperature of from 32*'C 
to 39*C, more preferably at a temperature of from 35°C to 38**C, and most preferably at a 
temperature of about 37*C during the time period over which the influence is observed. In 
30 one aspect of the invention the mechanically intact living cell is part of a matrix of identical or 
non-identical cells. 
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A cell used in the present invention should contain a nucleic acid constmct encoding a fusion 
polypeptide as defined herein and be capable of expressing the sequence encoded by the 
construct. The cell is a eukaryotic cell selected from the group consisting of fungal ceils, 
such as yeast cells; invertebrate cells including insect cells; vertebrate cells such as mam- 
malian cells. The prefen-ed cells are mammalian cells. 

In another aspect of the invention the cells could be from an organism carrying in at least 
one of its component cells a nucleic acid sequence encoding a fusion polypeptide as defined 
herein and be capable of expressing said nucleic acid sequence. The organism is selected 
from the group consisting of unicellular and multicellular organisms, such as a mammal. 



The luminophore 

The luminophore is the component which allows the redistribution to be visualised and/or 
recorded by emitting light in a spatial distribution related to the degree of influence. In one 
embodiment of the invention, the luminophore is capable of being redistributed in a manner 
which is physiologically relevant to the degree of the influence. In another embodiment, the 
luminophore is capable of associating with a component which is capable of being redistri- 
buted in a manner which is physiologically relevant to the degree of the Influence. In another 
embodiment, the luminophore correlation between the redistribution of the luminophore and 
the degree of the influence could be detemiined experimentally. In a preferred aspect of the 
invention, the luminophore is capable of being redistributed in substantially the same manner 
as the at least one component of an intracellular pathway. In yet another embodiment of the 
invention, the luminophore is capable of being quenched upon spatial association with a 
component which is redistributed by modulation of the pathway, the quenching being measu- 
red as a change in the intensity of the luminescence. 

The luminophore could be a fluorophore. In a preferred embodiment of the invention, the lu- 
minophore could be a polypeptide encoded by and expressed from a nucleotide sequence 
harboured in the cell or cells. The luminophore could be a hybrid polypeptide comprising a 
fusion of at least a portion of each of two polypeptides one of which comprises a luminescent 
polypeptide and the other one of which comprises a biologically active polypeptide, as defi- 
ned herein. 

The luminescent polypeptide could be a GFP as defined herein or could be selected from the 
group consisting of green fluorescent proteins having the F64L mutation as defined herein 
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such as F64L-GFP. F64L-Y66H-GFP. F64L-S65T-GFP. and EGFP. The GFP could be N- or 
C-terminaily tagged, optionally via a peptide linker, to the biologically active polypeptide or a 
part or a subunit thereof. The fluorescent probe could be a component of a Intracellular sig- 
nalling pathway. The probe is coded for by a nucleic acid construct. 

5 The pathway of investigation in the present invention could be an intracellular signalling pa- 
thway. 

The influence 

In a preferred embodiment of the invention, the influence could be contact between the me- 
10 chanically intact living cell or the group of mechanically intact living cells with a chemical 
substance and/or incubation of the mechanically intact living cell or the group of mechani- 
cally intact living cells with a chemical substance. The influence will modulate the intracellular 
processes. In one aspect the modulation could be an activation of the intracellular proces- 
ses. In another aspect the modulation could be an deactivation of the intracellular processes. 
15 In yet another aspect, the influence could inhibit or promote the redistribution without directly 
affecting the metabolic activity of the component of the intracellular processes. 

In one embodiment the invention is used as a basis for a screening program, where the ef- 
fect of unknown influences such as a compound library, can be compared to influence of 
known reference compounds under standardised conditions. 

20 

The recording 

In addition to the intensity, there are several parameters of fluorescence or luminescence 
which can be modulated by the effect of the influence on the underlying cellular phenomena, 
and can therefore be used in the invention. Some examples are resonance energy transfer. 
25 fluorescence lifetime, polarisation, wavelength shift. Each of these methods requires a parti- 
cular kind of filter in the emission light path to select the component of the light desired and 
reject other components. The recording of property of light could be In the form of an ordered 
array of values such as a CCD array or a vacuum tube device such as a vidicon tube. 

In one embodiment of the invention, the spatially distributed light emitted by a luminophore 
30 could be detected by a change in the resonance energy transfer between the luminophore 
and another luminescent entity capable of delivering energy to the luminophore, each of 
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Which has been selected or engineered to become part of. bound to or associated with parti- 
cular components of the intracellular pathway. In this embodiment, either the luminophore or 
the luminescent entity capable of delivering energy to the luminophore undergoes redistribu- 
tion in response to an influence. The resonance energy transfer would be measured as a 
change in the intensity of emission from the luminophore. preferably sensed by a single 
channel photodetector which responds only to the average intensity of the luminophore in a 
non-spatially resolved fashion. 

In one embodiment of the invention, the recording of the spatially distributed light could be 
made at a single point in time after the application of the influence. In another embodiment, 
the recording could be made at two points in time, one point being before, and the other 
point being after the application of the influence. The result or variation is determined from 
the change in fluorescence compared to the fluorescence measured prior to the influence or 
modulation. In another embodiment of the invention, the recording could be performed at a 
series of points in time, in which the application of the influence occurs at some time after 
the first time point in the series of recordings, the recording being performed, e.g., with a 
predetermined time spacing of from 0.1 seconds to 1 hour, preferably from 1 to 60 seconds, 
more preferably from 1 to 30 seconds, in particular from 1 to 10 seconds, over a time span 
of from 1 second to 12 hours, such as from 10 seconds to 12 hours, e.g.. from 10 seconds to 
one hour, such as from 60 seconds to 30 minutes or 20 minutes. The result or variation is 
detemiined from the change in fluorescence over time. The result or variation could also be 
detenDined as a change in the spatial distribution of the fluorescence over time. 



Apparatus 

The recording of spatially distributed luminescence emitted from the luminophore is perfor- 
med by an apparatus for measuring the distribution of fluorescence in the cell or cells, and 
thereby any change in the distribution of fluorescence in the cell or cells, which includes at a 
minimum the following component parts: (a) a light source, (b) a method for selecting the 
wavelength(s) of light from the source which will excite the fluorescence of the protein, (c) a 
device which can rapidly block or pass the excitation light into the rest of the system, (d) a 
series of optical elements for conveying the excitation light to the specimen, collecting the 
emitted fluorescence in a spatially resolved fashion, and fomiing an image from this fluore- 
scence emission, (e) a bench or stand which holds the container of the cells being measured 
in a predetermined geometry with respect to the series of optical elements. (0 a detector to 
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record the spatially resolved fluorescence in the form of an image, (g) a computer or electro- 
nic system and associated software to acquire and store the recorded images, and to com- 
pute the degree of redistribution from the recorded images. 

In a preferred embodiment of the invention the apparatus system is automated. In one em- 
5 bodiment the components in d and e mentioned above comprise a fluorescence microscope. 
In one embodiment the component in f mentioned above is a CCD camera. 

In one embodiment the image is formed and recorded by an optical scanning system. 

In one embodiment a liquid addition system is used to add a known or unknown compound 
to any or all of the ceils in the cell holder at a time determined in advance. Preferably, the 
10 liquid addition system is under the control of the computer or electronic system. Such an 

automated system can be used for a screening program due to its ability to generate results 
from a larger number of test compounds than a human operator could generate using the 
apparatus in a manual fashion. 



15 Quantitation of the influence 

The recording of the variation or result with respect to light emitted from the luminophore is 
performed by recording the spatially distributed light as one or more digital images, and the 
processing of the recorded variation to reduce it to one or more numbers representative of 
the degree of redistribution comprises a digital image processing procedure or combination 

20 of digital image processing procedures. The quantitative information which is indicative of the 
degree of the cellular response to the influence or the result of the influence on the intracel- 
lular pathway is extracted from the recording or recordings according to a predetermined ca- 
libration based on responses or results, recorded in the same manner, to known degrees of 
a relevant specific influence. This calibration procedure is developed according to principles 

25 described below (Developing an Image-based Assay Technique). Specific descriptions of the 
procedures for particular assays are given in the examples. 

While the stepwise procedure necessary to reduce the image or images to the value repre- 
sentative of the is particular to each assay, the individual steps are generally well-known 
methods of image processing. Some examples of the individual steps are point operations 
30 such as subtraction, ratioing, and thresholding, digital filtering methods such as smoothing, 
sharpening, and edge detection, spatial frequency methods such as Fourier filtering, image 
cross-correlation and Image autocorrelation, object finding and classification (blob analysis). 
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and colour space manipulations for visualisation. In addition to the algorithmic procedures, 
heuristic methods such as neural networks may also be used. 



Nucleic acid constructs 

The nucleic acid constructs used in the present invention encode in their nucleic acid sequ- 
ences fusion polypeptides comprising a biologically active polypeptide that is a component of 
an intracellular signalling pathway, or a part thereof, and a GFP, preferably an F64L mutant 
of GFP, N- or C-termlnally fused, optionally via a peptide linl^er, to the biologically active po- 
lypeptide or part thereof. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a protein kinase or a phosphatase. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a transcription factor or a part thereof which changes cellular localisation upon activation, 
in one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a protein, or a part thereof, which is associated with the cytoskeletal network and which 
changes cellular localisation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a protein kinase or a part thereof which changes cellular localisation upon activation. 
In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a serine/threonine protein kinase or a part thereof capable of changing intracellular locali- 
sation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a tyrosine protein kinase or a part thereof capable of changing intracellular localisation 
upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a phospholipid-dependent serine/threonine protein kinase or a part thereof capable of 
changing intracellular localisation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a cAMP-dependent protein kinase or a part thereof capable of changing cellular localisati- 
on upon activation. In a preferred embodiment the biologically active polypeptide encoded by 
the nucleic acid construct is a PKAc-F64L-S65T-GFP fusion. 
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In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a cGMP-dependent protein kinase or a part thereof capable of changing cellular localisati- 
on upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
5 is a calmodulin-dependent serine/threonine protein kinase or a part thereof capable of chan- 
ging cellular localisation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a mitogen-activated serine/threonine protein kinase or a part thereof capable of changing 
cellular localisation upon activation. In preferred embodiments the biologically active poly- 
10 peptide encoded by the nucleic acid constructs are an ERK1-F64L-S65T-GFP fusion or an 
EGFP-ERK1 fusion. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a cyclin-dependent serine/threonine protein kinase or a part thereof capable of changing 
cellular localisation upon activation. 

15 In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a protein phosphatase or a part thereof capable of changing cellular localisation upon acti- 
vation. 

In one preferred embodiment of the invention the nucleic acid constructs may be DNA con- 
structs. 

20 In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
In one embodiment the gene encoding GFP in the nucleic acid construct is derived from Ae- 
quorea victoria. In a preferred embodiment the gene encoding GFP in the nucleic acid con- 
struct is EGFP or a GFP variant selected from F64L-GFP. F64L-Y66H-GFP and F64L-S65T- 
GFP. 

25 In preferred embodiments of the invention the DNA constructs which can be identified by any 
of the DNA sequences shown in SEQ ID NO: 38, 40, 42, 44. 46, 48. 50, 52. 54, 56, 58, 60. 
62, 64. 66. 68. 70. 72. 74. 76. 78. 108. 110. 112, 114, 116, 118. 120, 122. 124. 126, 128, 
130, 132. 134. 136, 138, 140. 142 or are variants of these sequences capable of encoding 
the same fusion polypeptide or a fusion polypeptide which is biologically equivalent thereto, 

30 e.g. an isoform, or a splice variant or a homologue from another species. 
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Screening program 

The present invention describes a method that may be used to establish a screening pro- 
gram for the identification of biologically active substances that directly or indirectly affects 
intracellular signalling pathways and because of this property are potentially useful as me- 
dicaments. Based on measurements in living cells of the redistribution of spatially resolved 
luminescence firom luminophores which undergo a change in distribution upon activation or 
deactivation of an intracellular signalling pathway the result of the individual measurement of 
each substance being screened indicates its potential biological activity. 

In one embodiment of the invention the screening program is used for the identification of a 
biologically toxic substance as defined herein that exerts its toxic effect by interfering with an 
intracellular signalling pathway. Based on measurements in living cells of the redistribution of 
spatially resolved luminescence from luminophores which undergo a change in distribution 
upon activation or deactivation of an intracellular signalling pathway the result of the individ- 
ual measurement of each substance being screened indicates its potential biologically toxic 
activity. In one embodiment of a screening program a compound that modulates a compo- 
nent of an intracellular pathway as defined herein, can be found and the therapeutic amount 
of the compound estimated by a method according to the method of the invention. In a pre- 
fen-ed embodiment the present invention leads to the discovery of a new way of treating a 
condition or disease related to the intracellular function of a biologically active polypeptide 
comprising administration to a patient suffering from said condition or disease of an effective 
amount of a compound which has been discovered by any method according to the inven- 
tion. In another preferred embodiment of the invention a method is established for identifica- 
tion of a new drug target or several new drug targets among the group of biologically active 
polypeptides which are components of intracellular signalling pathways. 

In another embodiment of the invention an individual treatment regimen is established for the 
selective treatment of a selected patient suffering from an ailment where the available me- 
dicaments used for treatment of the ailment are tested on a relevant primary cell or cells 
obtained from said patient from one or several tissues, using a method comprising trans- 
fecting the cell or cells with at least one DNA sequence encoding a fluorescent probe ac- 
cording to the invention, transferring the transfected cell or cells back the said patient, or 
culturing the cell or cells under conditions permitting the expression of said probes and ex- 
posing it to an array of the available medicaments, then comparing changes in fluorescence 
patterns or redistribution patterns of the fluorescent probes in the intact living cell or cells to 
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detect the cellular response to the specific medicaments (obtaining a cellular action profile), 
then selecting one or more medicament or medicaments based on the desired activity and 
acceptable level of side effects and administering an effective amount of these medicaments 
to the selected patient. 

5 

Back-tracking of a signal transduction pathway 

The present invention describes a method that may be used to establish a screening pro- 
gram for back-tracking signal transduction pathways as defined herein. In one embodiment 
the screening program is used to establish more precisely at which level one or several 
10 compounds affect a specific signal transduction pathway by successively or in parallel test- 
ing the influence of the compound or compounds on the redistribution of spatially resolved 
luminescence from several of the luminophores which undergo a change in distribution upon 
activation or deactivation of the intracellular signalling pathway under study. 

15 Construction and testing of probes 

In general, a probe, i.e. a "GeneX"-GFP fusion or a GFP-"GeneX" fusion, is constructed 
using PGR with "GeneX"-speclfic primers followed by a cloning step to fuse "GeneX" in 
frame with GFP. The fusion may contain a short vector derived sequence between "GeneX" 
and GFP (e.g. part of a multiple cloning site region in the plasmid) resulting in a peptide lin- 
20 ker between "GeneX" and GFP in the resulting fusion protein. 

Petgjied stepwise progedurg; 

- Identifying the sequence of the gene. This is most readily done by searching a depository 
of genetic information, e.g. the GenBank Sequence Database, which is widely available and 

25 routinely used by molecular biologists. In the specific examples below the GenBank Acces- 
sion number of the gene in question is provided. 

- Design of gene-specific primers. Inspection of the sequence of the gene allows design of 
gene-specific primers to be used in a PGR reaction. Typically, the top-strand primer en- 

30 compasses the ATG start codon of the gene and the following ca. 20 nucleotides, while the 
bottom-strand primer encompasses the stop codon and the ca. 20 preceding nucleotides, if 
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the gene is to be fused behind GFP. i.e. a GFP-"GeneX" fusion. If the gene is to be fused in 
front of GFP, i.e. a "GeneX"-GFP fusion, a stop codon must be avoided. Optionally, the full 
length sequence of GeneX may not be used in the fusion, but merely the part which localizes 
and redistributes like GeneX in response to a signal. 



In addition to gene-specific sequences, the primers contain at least one recognition sequen- 
ce for a restriction enzyme, to allow subsequent cloning of the PGR product. The sites are 
chosen so that they are unique in the PGR product and compatible with sites in the cloning 
vector Furthermore, it may be necessary to include an exact number of nucleotides between 
the restriction enzyme site and the gene-specific sequence in order to establish the correct 
reading frame of the fusion gene and/or a translation initiation consensus sequence. Lastly, 
the primers always contain a few nucleotides in front of the restriction enzyme site to allow 
efficient digestion with the enzyme. 



-Identifying a source of the gene to be amplified. In order for a PGR reaction to produce a 
product with gene-specific primers, the gene-sequence must initially be present in the reacti- 
on, e.g. in the form of cDNA. Infomiation in GenBank or the scientific literature will usually 
Indicate In which tissue(s) the gene is expressed, and cDNA libraries from a great variety of 
tissues or cell types from various species are commercially available, e.g. from Clontech 
(Palo Alto), Stratagene (La Jolla) and Invitrogen (San Diego). Many genes are also available 
in cloned form from The American Type Tissue Collection (Virginia). 



- Optimizing the PGR reaction. Several factors are known to Influence the efficiency and 
specificity of a PGR reaction, including the annealing temperature of the primers, the con- 
centration of ions, notably Mg^* and K^ present in the reaction, as well as pH of the reaction. 
If the result of a PGR reaction Is deemed unsatisfactory, it might be because the parameters 
mentioned above are not optimal. Various annealing temperatures should be tested, e.g. in a 
PGR machine with a built-in temperature gradient, available from e.g. Stratagene (La Jolla). 
and/or various buffer compositions should be tried, e.g. the OptiPrime buffer system from 
Stratagene (La Jolla). 
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- Cloning the PGR product. The vector into which the amplified gene product will be cloned 
and fused with GFP will already have been taken into consideration when the primers were 
designed. When choosing a vector, one should at least consider in which cell types the pro- 
be subsequently will be expressed, so that the promoter controlling expression of the probe 

5 is compatible with the cells. Most expression vectors also contain one or more selective 
markers, e.g. conferring resistance to a drug, which is a useful feature when one wants to 
make stable transfectants. The selective marker should also be compatible with the cells to 
be used. 

10 The actual cloning of the PGR product should present no difficulty as it typically will be a 
one-step cloning of a fragment digested with two different restriction enzymes into a vector 
digested with the same two enzymes. If the cloning proves to be problematic, it may be be- 
cause the restriction enzymes did not work well with the PGR fragment. In this case one 
could add longer extensions to the end of the primers to overcome a possible difficulty of di- 

15 gestion close to a fragment end, or one could introduce an Intermediate cloning step not ba- 
sed on restriction enzyme digestion. Several companies offer systems for this approach, e.g. 
Invitrogen (San Diego) and Clontech (Palo Alto). 

Once the gene has been cloned and, in the process, fused with the GFP gene, the resulting 
20 product, usually a plasmid, should be carefully checked to make sure it is as expected. The 
most exact test would be to obtain the nucleotide sequence of the fusion-gene. 

Testing the probe 

Once a DNA construct for a probe has been generated, its functionality and usefulness may 
25 be tested by subjecting it to the following tests: 

- Transfecting it into cells capable of expressing the probe. The fluorescence of the cell Is 
inspected soon after, typically the next day. At this point, two features of cellular fluorescerv 
ce are noted: the intensity and the sub-cellular localization. 

30 
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The intensity should usually be at least as strong as that of unfused GFP in the cells. If it h 
not. the sequence or quality of the probe-DNA might be faulty, and should be carefully 
checked. 



The sub-cellular localization is an indication of whether the probe is likely to perform well. If it 
localizes as expected for the gene in question, e.g. is excluded from the nucleus, it can Im- 
mediately go on to a functional test. If the probe is not localized soon after the transfection 
procedure, it may be because of overexpression at this point in time, as the cell typically will 
have taken of very many copies of the plasmid, and localization will occur In time. e.g. within 
a few weeks, as piasmid copy number and expression level decreases. If localization does 
not occur after prolonged time, it may be because the fusion to GFP has destroyed a locali- 
zation function, e.g. masked a protein sequence essential for interaction with its normal cel- 
lular anchor-protein. In this case the opposite fusion might work. e.g. if GeneX-GFP does not 
work, GFP-GeneX might, as two different parts of GeneX will be affected by the proximity to 
GFP. If this does not work, the proximity of GFP at either end might be a problem, and it 
could be attempted to increase the distance by incorporating a longer linker between GeneX 
and GFP in the DNA construct. 

If there is no prior knowledge of localization, and no localization is observed, it may be be- 
cause the probe should not be localized at this point, because such is the nature of the pro- 
tein fused to GFP. It should then be subjected to a functional test. 

In a functional test, the cells expressing the probe are treated with at least one compound 
known to perturb, usually by activating, the signalling pathway on which the probe is expec- 
ted to report by redistributing itself within the cell. If the redistribution is as expected, e.g. if 
prior knowledge tell that it should translocate from location X to location Y. it has passed the 
first critical test. In this case it can go on to further characterization and quantification of the 
response. 

If it does not perform as expected, it may be because the cell lacks at least one component 
of the signalling pathway, e.g. a cell surface receptor, or there is species incompatibility, e.g. 
if the probe is modelled on sequence information of a human geneproduct, and the cell is of 
hamster origin. In both instances one should identify other cell types for the testing process 
where these potential problems would not apply. 
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If there is no prior knowledge about the pattern of redistribution, the analysis of the redistri- 
bution will have to be done in greater depth to identify what the essential and indicative fea- 
tures are. and when this is clear, it can go on to further characterization and quantification of 
the response. If no feature of redistribution can be identified, the problem might be as menti- 
5 oned above, and the probe should be retested under more optimal cellular conditions. 

If the probe does not perform under optimal cellular conditions it's back to the drawing board. 



Developing an image-based assay technique 

The process of developing an image-based redistribution assay begins with either the un- 
10 planned experimental observation that a redistribution phenomenon can be visualised, or the 
design of a probe specifically to follow a redistribution phenomenon already known to occur. 
In either event, the first and best exploratory technique is for a trained scientist or technician 
to observe the phenomenon. Even with the rapid advances in computing technology, the 
human eye-brain combination is still the most powerful pattern recognition system known, 
15 and requires no advance knowledge of the system in order to detect potentially interesting 
and useful patterns in raw data. This is especially if those data are presented in the form of 
images, which are the natural ''data type" for human visual processing. Because human vi- 
sual processing operates most effectively in a relatively narrow frequency range, i.e., we 
cannot see either very fast or very slow changes in our visual field, it may be necessary to 
20 record the data and play it back with either time dilation or time compression. 

Some luminescence phenomena cannot be seen directly by the human eye. Examples in- 
clude polarization and fluorescence lifetime. However, with suitable filters or detectors, these 
signals can be recorded as images or sequences of images and displayed to the human in 
the fashion just described. In this way, patterns can be detected and the same methods can 
25 be applied. 

Once the redistribition has been determined to be a reproducible phenomenon, one or more 
data sets are generated for the purpose of developing a procedure for extracting the quanti- 
tative information from the data. In parallel, the biological and optical conditions are determi- 
ned which will give the best quality raw data for the assay. This can become an iterative pro- 
30 cess; it may be necessary to develop a quantitative procedure in order to assess the effect 
on the assay of manipulating the assay conditions. 
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The data sets are examined by a person or persons with knowledge of the biological phe- 
nomenon and skill in the application of image processing techniques. The goal of this exerci- 
se is to determine or at least propose a method which will reduce the image or sequence of 
images constituting the record of a "response" to a value corresponding to the degree of the 
response. Using either interactive image processing software or an image processing tool- 
box and a programming language, the method is encoded as a procedure or algorithm which 
takes the image or images as input and generates the degree of response (in any units) as 
its output. Some of the criteria for evaluating the validity of a particular procedure are: 

• Does the degree of the response vary in a biologically significant fashion, i.e., does it 
show the known or putative dependence on the concentration of the stimulating 
agent or condition? 

• Is the degree of response reproducible, i.e., does the same concentration or level of 
stimulating agent or condition give the same response with an acceptable variance? 

• Is the dynamic range of the response sufficient for the purpose of the assay? If not, 
can a change in the procedure or one of its parameters Improve the dynamic range? 

• Does the procedure exhibit any clear "pathologies", i.e., does it give ridiculous values 
for the response if there are commonly occurring imperfections in the imaging pro- 
cess? Can these pathologies be eliminated, controlled, or accounted for? 

• Can the procedure deal with the normal variation in the number and/or size of cells in 
an image? 

In some cases the method may be obvious; in others, a number of possible procedures may 
suggest themselves. Even if one method appears cleariy superior to others, optimisation of 
parameters may be required. The various procedures are applied to the data set and the 
criteria suggested above are determined, or the single procedure is applied repeatedly with 
adjustment of the parameter or parameters until the most satisfactory combination of signal, 
noise, range, etc. are arrived at. This is equivalent to the calibration of any type of single- 
channel sensor. 

The number of ways of extracting a single value from an image are extremely large, and 
thus an intelligent approach must be taken to the initial step of reducing this number to a 
small, finite number of possible procedures. This is not to say that the procedure arrived at is 
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necessarily the best procedure - but a global search for the best procedure is simply out of 
the question due to the sheer number of possibilities involved. 

Image-based assays are no different than other assay techniques in that their usefulness is 
characterised by parameters such as the specificity for the desired component of the 
5 sample, the dynamic range, the variance, the sensitivity, the concentration range over which 
the assay will work, and other such parameters. While it is not necessary to characterise e- 
ach and every one of these before using the assay, they represent the only way to compare 
one assay with another. 



10 Example: Developing a Quantitati ve assay for GLUT4 Translocation 

GLUT4 is a member of the class of glucose transporter molecules which are important in 
cellular glucose uptake. It is known to translocate to the plasma membrane under some 
conditions of stimulation of glucose uptake. The ability to visualize the glucose uptake re- 
sponse noninvasively. without actually measuring glucose uptake, would be a very useful 
15 assay for anyone looking for. for example, treatments for type 11 diabetes. 

A CHO cell line which stably expressed the human insulin receptor was used as the basis for 
a new cell line which stably expressed a fusion between GLUT4 and GFP. This cell line was 
expected to show translocation of GLUT4 to the plasma membrane as visualized by the 
movement of the GFP. The translocation could definitely be seen in the form of the appea- 
20 ranee of local increases in the fluorescence in regions of the plasma membrane which had a 
characteristic shape or pattern. This is shown in Figure 12. 

These objects became known as "snircles", and the phenomenon of their appearance as 
"snircling". In order to quantitate their appearance, a method had to be found to isolate them 
as objects in the image field, and then enumerate them, measure their area, or determine 

25 some parameter about them which correlated in a dose-dependent fashion with the concen- 
tration of insulin to which the cells had been exposed. In order to separate the snircles, a bi- 
narizatlon procedure was applied in which one copy of the image smoothed with a relatively 
severe gaussian kernel (sigma = 2.5) was subtracted from another copy to which only a re- 
latively light gaussian smooth had been applied (sigma=0.5). The resultant image was re- 

30 scaled to Its min/max range, and an automatic threshold was applied to divide the image into 
two levels. The thresholded image contains a background of one value all found object with 
another value. The found objects were first filtered through a filter to remove objects far too 
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large and far too small to be snircles. The remaining objects, which represent snircles and 
other artifacts from the image with approximately the same size and intensity characteristics 
as snircles. are passed into a classification procedure which has been previously trained with 
many images of snircles to recognize snircles and exclude the other artifacts. The result of 
5 this procedure is a binary image which shows only the found snircles to the degree to which 
the classification procedure can accurately identify them. The total area of the snircles is 
then summed and this value is the quantitative measure of the degree of snircling for that 
image. 

10 Definitions: 

In the present specification and claims, the term "an influence" covers any influence to which 
the cellular response comprises a redistribution. Thus, e.g., heating, cooling, high pressure, 
low pressure, humidifying, or drying are influences on the cellular response on which the re- 
sulting redistribution can be quantified, but as mentioned above, perhaps the most important 
15 Influences are the influences of contacting or incubating the cell or cells with substances 

which are known or suspected to exert and influence on the cellular response involving a re- 
distribution contribution. In another embodiment of the invention the influence could be sub- 
stances from a compound drug library. 

20 In the present context, the term "green fluorescent protein" is intended to indicate a protein 
which, when expressed by a cell, emits fluorescence upon exposure to light of the correct 
excitation wavelength (cf. [(Chalfie et a/. 1994)]). In the following. GFP in which one or more 
amino acids have been substituted, inserted or deleted is most often termed "modified GFP". 
"GFP" as used herein includes wild-type GFP derived from the jelly fish Aequorea victoria 

25 and modifications of GFP. such as the blue fluorescent variant of GFP disclosed by Helm et 
al. (1994). Proc.Natl.Acad.Sci. 91:12501. and other modifications that change the spectral 
properties of the GFP fluorescence, or modifications that exhibit increased fluorescence 
when expressed in ceils at a temperature above about 30*^0 described in PCT/DK96/00051. 
published as WO 97/1 1094 on 27 March 1997 and hereby incorporated by reference, and 

30 which comprises a fluorescent protein derived from Aequorea Green Fluorescent Protein 

(GFP) or any functional analogue thereof, wherein the amino add in position 1 upstream from 
the chromophore has been mutated to provide an increase of fluorescence intensity when the 
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fluorescent protein of the invention Is expressed in cells. Preferred GFP variants are F64L- 
GFP, F64L-Y66H-GFP and F64L-S65T-GFP. An especially preferred variant of GFP for use 
in all the aspects of this invention is EGFP (DNA encoding EGFP which is a F64L-S65T va- 
riant with codons optimized for expression in nnammatian cells is available from Clontech, 
5 Palo Alto, plasmids containing the EGFP DNA sequence, cf. GenBank Acc. Nos. U55762, 
U55763). 

The term "intracellular signalling pathway" and ''signal transduction pathway" are intended to 
indicate the coordinated intracellular processes whereby a living cell transduce an extemal 
10 or internal signal into cellular responses. Said signal transduction will Involve an enzymatic 
reaction said enzymes include but are not limited to protein kinases, GTPases, ATPases. 
protein phosphatases, phospholipases. The cellular responses include but are not limited to 
gene transcription, secretion, proliferation, mechanical activity, metabolic activity, cell death. 

15 The term "second messenger" is used to indicate a low molecular weight component invol- 
ved in the early events of intracellular signal transduction pathways. 

The term "luminophore" is used to indicate a chemical substance which has the property of 
emitting light either inherently or upon stimulation with chemical or physical means. This in- 
cludes but is not limited to fluorescence, bioluminescence. phosphorescence, chemilumine- 
20 scence. 

The term "mechanically intact living cell" is used to indicate a cell which is considered living 
according to standard criteria for that particular type of cell such as maintenance of normal 
membrane potential, energy metabolism, proliferative capability, and has not experienced 
25 any physically invasive treatment designed to introduce extemal substances into the cell 
such as microinjection. 

The term "physiologically relevant" ,when applied to an experimentally determined redistribu- 
tion of an intracellular component, as measured by a change in the luminescence properties 
30 or distribution, is used to indicate that said redistribution can be explained in terms of the un- 
derlying biological phenomenon which gives rise to the redistribution. 
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Th terms "image processing" and "image analysis" are used to describe a large family of di- 
gital data analysis techniques or combination of such techniques which reduce ordered ar- 
rays of numbers (images) to quantitative information describing those ordered arrays of 
numbers. When said ordered arrays of numbers represent measured values from a physical 
process, the quantitative infonmation derived is therefore a measure of the physical process. 



The term "fluorescent probe" is used to indicate a fluorescent fusion polypeptide comprising 
a GFP or any functional part thereof which is N- or C-terminally fused to a biologically active 
polypeptide as defined herein, optionally via a peptide linl<er consisting of one or more amino 
acid residues, where the size of the linker peptide in itself is not critical as long as the desi- 
red functionality of the fluorescent probe is maintained. A fluorescent probe according to the 
invention is expressed in a cell and basically mimics the physiological behaviour of the biolo- 
gically active polypeptide moiety of the fusion polypeptide. 

The term 'mammalian cell" is intended to indicate any living cell of mammalian origin. The 
cell may be an established cell line, many of which are available from The American Type 
Culture Collection (ATCC. Virginia. USA) or a primary cell with a limited life span derived 
from a mammalian tissue, including tissues derived from a transgenic animal, or a newly 
established immortal cell line derived froma mammalian tissue including transgenic tissues, 
or a hybrid cell or cell line derived by fusing different celltypes of mammalian origin e.g. hy- 
bridoma cell lines. The cells may optionally express one or more non-native gene products, 
e.g. receptors, enzymes, enzyme substrates, prior to or in addition to the fluorescent probe. 
Preferred cell lines include but are not limited to those of fibroblast origin, e.g. BHK, CHO, 
BALB. or of endothelial origin, e.g. HUVEC. BAE (bovine artery endothelial), CPAE (cow 
pulmonary artery endothelial) or of pancreatic origin, e.g. RIN, INS-1 , MIN6, bTC3. aTC6. 
bTC6, HIT. or of hematopoietic origin, e.g. adipocyte origin, e.g. 3T3-L1 . neuro- 
nal/neuroendocrine origin. e.g. AtT20. PC12, GH3, muscle origin, e.g. SKMC. A10, C2C12, 
renal origin, e.g. HEK 293. LLC-PK1 . 

The term "hybrid polypeptide" is intended to indicate a polypeptide which is a fusion of at le- 
ast a portion of each of two proteins, in this case at least a portion of the green fluorescent 
protein, and at least a portion of a catalytic and/or regulatory domain of a protein kinase. 
Furthermore a hybrid polypeptide is intended to indicate a fusion polypeptide comprising a 
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GFP or at least a portion of the green fluorescent protein that contains a functional fluorop- 
hore, and at least a portion of a biologically active polypeptide as defined herein provided 
that said fusion is not the PKCa-GFP, PKCy-GFP. and PKCe-GFP disclosed by Schmidt et 
al.and Sakai et al., respectively. Thus, GFP may be N- or C-terminally tagged to a biologi- 
5 caliy active polypeptide, optionally via a linker portion or linker peptide consisting of a sequ- 
ence of one or more amino acids. The hybrid polypeptide or fusion polypeptide may act as a 
fluorescent probe in intact living cells carrying a DNA sequence encoding the hybrid poly- 
peptide under conditions permitting expression of said hybrid polypeptide. 

10 The term "kinase" is intended to indicate an enzyme that is capable of phosphorylating a 
cellular component. 

The term "protein kinase" is intended to indicate an enzyme that is capable of phosphoryla- 
ting serine and/or threonine and/or tyrosine in peptides and/or proteins. 

15 The term "phosphatase" is intended to indicate an enzyme that is capable of dephosphoryla- 
ting phosphoserine and/or phosphothreonine and/or phosphotyrosine in peptides and/or 
proteins. 

In the present context, the term "biologically active polypeptide" is intended to indicate a po- 
20 lypeptide affecting intracellular processes upon activation, such as an enzyme which is acti- 
ve in intracellular processes or a portion thereof comprising a desired amino acid sequence 
which has a biological function or exerts a biological effect in a cellular system. In the poly- 
peptide one or several aminoacids may have been deleted, inserted or replaced to alter its 
biological function, e.g. by rendering a catalytic site inactive. Preferably, the biologically acti- 
25 ve. polypeptide is selected from the group consisting of proteins taking part in an intracellular 
signalling pathway, such as enzymes involved in the intracellular phosphorylation and dep- 
hosphorylation processes including kinases, protein kinases and phosphorylases as defined 
herein, but also proteins making up the cytoskeleton play important roles in intracellular sig- 
nal transduction and are therefore included in the meaning of "biologically active polypeptide" 
30 herein. More preferably, the biologically active polypeptide is a protein which according to its 
state as activated or non-activated changes localisation within the cell, preferably as an In- 
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termediary component in a signal transduction pathway. Included in this preferred group of 
biologically active polypeptides are cAMP dependent protein kinase A. 



10 



15 



20 



The term "a substance having biological activity" is intended to indicate any sample which 
has a biological function or exerts a biological effect in a cellular system. The sample may be 
a sample of a biological material such as a sample of a body fluid including blood, plasma, 
saliva, milk, urine, or a microbial or plant extract, an environmental sample containing pollu- 
tants including heavy metals or toxins, or it may be a sample containing a compound or 
mixture of compounds prepared by organic synthesis or genetic techniques. 

The phrase "any change in fluorescence" means any change in absorption properties, such 
as wavelength and Intensity, or any change in spectral properties of the emitted light, such 
as a change of wavelength, fluorescence lifetime, intensity or polarisation, or any change in 
the intracellular localisation of the fluorophore. It may thus be localised to a specific cellular 
component (e.g. organelle, membrane, cytoskeleton. molecular structure) or it may be 
evenly distributed throughout the cell or parts of the cell. 

The phrase "back-tracking of a signal transduction pathway" is intended to indicate. 
The term "organism" as used herein indicates any unicellular or multicellular organism prefe- 
rably originating from the animal kingdom including protozoans, but also organisms that are 
members of the plant kingdoms, such as algae, fungi, bryophytes. and vascular plants are 
included in this definition. 



25 



30 



The term "nucleic acid" is intended to indicate any type of poly- or oligonucleic acid sequen- 
ce, such as a DNA sequence, a cDNA sequence, or an RNA sequence. 

The term "biologically equivalent" as it relates to proteins is intended to mean that a first 
protein is equivalent to a second protein if the cellular functions of the two proteins may sub- 
stitute for each other, e.g. if the two proteins are closely related isoforms encoded by diffe- 
rent genes, if they are splicing variants, or allelic variants derived from the same gene, if 
they perfomi identical cellular functions in different cell types, or in different species. The 
temi -biologically equivalent" as it relates to DNA is intended to mean that a first DNA sequ- 
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ence encoding a polypeptide Is equivalent to a second DNA sequence encoding a polypepti- 
de if the functional proteins encoded by the two genes are biologically equivalent. 

The phrase "back-tracking of a signal transduction pathway" is intended to indicate a process 
5 for defining more precisely at what level a signal transduction pathway is affected, either by 
the influence of chemical compounds or a disease state in an organism. Consider a specific 
signal transduction pathway represented by the bioactive polypeptides A - B - C - D. with 
signal transduction from A towards D. When investigating ail components of this signal 
transduction pathway compounds or disease states that Influence the activity or redlstribu- 
10 tion of only D can be considered to act on C or downstream of C whereas compounds or 
disease states that influence the activity or redistribution of C and but not of A and B can 
be considered to act downstream of B. 

The term "fixed cells** is used to mean cells treated with a cytoiogical fixative such as glu- 
15 taraldehyde or formaldehyde, treatments which serve to chemically cross-link and stabilize 
soluble and insoluble proteins within the structure of the cell. Once in this state, such pro- 
teins cannot be lost from the structure of the now-dead cell. 



20 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 . CHO cells expressing the PKAc-F64L-S65T-GFP hybrid protein have been treated 
In HAM'S F12 medium with 50 mM forskotin at 37°C. The images of the GFP fluorescence In 
25 these cells have been taken at different time intervals after treatment, which were: a) 40 se- 
conds b) 60 seconds c) 70 seconds d) 80 seconds. The fluorescence changes from a 
punctate to a more even distribution within the (non-nuclear) cytoplasm. 
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Figure 2. Time-lapse analysis of forskolin induced PKAc-F64L.S65T-GFP redistribution 
CHO ceils, expressing the PKAc-F64L-S65T-GFP fusion protein were analysed by time- 
lapse fluorescence microscopy. Fluorescence micrographs were acquired at regular intervals 
from 2 min before to 8 min after the addition of agonist. The cells were challenged with 1 mM 
forskolin immediately after the upper left image was acquired (t=0). Frames were collected at 
the following times: i) 0. ii) 1 . iii) 2. iv) 3. v) 4 and vi) 5 minutes. Scale bar 10 mm 
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Figure 3. Time-lapse analyses of PKAc-F64L-S65T-GFP redistribution in response to various 
agonists. The effects of 1 mM forskolin (A). 50 mM forskolin (B). 1mM dbcAMP (C) and 100 
mM IBMX (D) (additions indicated by open arrows) on the localisation of the PKAc-F64L- 
S65T-GFP fusion protein were analysed by time-lapse fluorescence microscopy of 
CHO/PKAC-F64L-S65T-GFP cells. The effect of addition of 10 mM forskolin (open arrow) 
followed shortly by repeated washing with buffer (solid arrow), on the localisation of the 
PKAC-F64L-S65T-GFP fusion protein was analysed in the same cells (E). In a parallel expe- 
riment, the effect of adding 10 mM forskolin and 100 mM IBMX (open arrow) followed by re- 
peated washing with buffer containing 100 mM IBMX (solid arrow) was analysed (F). Remo- 
ving forskolin caused PKAc-F64L-S65T-GFP fusion protein to return to the cytoplasmic ag- 
gregates while this is prevented by the continued presence of IBMX (F). The effect of 100 
nM glucagon (Fig 3G. open arrow) on the localisation of the PKAc-F64L-S65T-GFP fusion 
protein is also shown for BHK/GR. PKAc-F64L-S65T-GFP cells. The effect of 10 mM norepi- 
nephrine (H). solid anrow. on the localisation of the PKAc-F64L-S65T-GFP fusion protein 
was analysed similarty. in transiently transfected CHO. PKAc-F64L-S65T-GFP cells, pretre- 
ated with 10 mM forskolin. open arrow, to increase [cAMP], N.B. in Fig 3H the x-axis counts 
the image numbers, with 12 seconds between images. The raw data of each experiment 
consisted of 60 fluorescence micrographs acquired at regular intervals including several 
■mages acquired before the addition of buffer or agonist. The charts (A-G) each show a 
quantification of the response seen through all the 60 images, performed as described in 
analysis method 2. The change in total area of the highly fluorescent aggregates, relative to 
the initial area of fluorescent aggregates is plotted as the ordinate in all graphs in Figure 3. 
versus time for each experiment. Scale bar 10 mm. 
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Figure 4. Dose response curve (two experiments) for forskolin-induced redistribution of the 
PKAC-F64L-S65T-GFP fusion. 

Figure 5. Time from initiation of a response to half maximal (Uamax) and maximal {imax) PKAc- 
F64L-S65T-GFP redistribution. The data was extracted from curves such as that shown in 
"Figure 2." All ti/zmax and t^ax values are given as mean±SD and are based on a total of 26-30 
cells from 2-3 independent experiments for each forskolin concentration. Since the observed 
redistribution is sustained over time, the t^^^ values were taken as the eariiest time point at 
which complete redistribution is reached. Note that the values do not relate to the degree of 
redistribution. 



Figure 6. Parallel dose response analyses of forskolin induced cAMP elevation and PKAc- 
F64L-S65T-GFP redistribution. The effects of buffer or 5 increasing concentrations of forsko- 
lin on the localisation of the PKAc-F64L-S65T-GFP fusion protein in CHO/PKAc-F64L-S65T- 
GFP cells, grown in a 96 well plate, were analysed as described above. Computing the ratio 
of the SD's of fluorescence micrographs taken of the same field of cells, prior to and 30 min 
after the addition of forskolin, gave a reproducible measure of PKAc-F64L-S65T-GFP redi- 
stribution. The graph shows the individual 48 measurements and a trace of their me- 
an±s.e.m at each forskolin concentration. For comparison, the effects of buffer or 8 increa- 
sing concentrations of forskolin on [cAMPjj was analysed by a scintillation proximity assay of 
cells grown under the same conditions. The graph shows a trace of the mean ± s.e.m of 4 
experiments expressed in arbitrary units. 

Figure 7. BHK cells stably transfected with the human muscarinic (hM1) receptor and the 
PKCa-F64L-S65T-GFP fusion. Carbachol (100 mM added at 1.0 second) Induced a transient 
redistribution of PKCa-F64L-S65T-GFP from the cytoplasm to the plasma membrane. Ima- 
ges were taken at the following times: a) 1 second before carbachol addition, b) 8.8 seconds 
after addition and c) 52.8 seconds after addition. 
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Figure 8. BHK cells stably transfected with the hM1 receptor and PKCa-F64L-S65T-GFP 
fusion were treated with carbachol (1 mM. 10 mM. 100 mM). In single cells intracellular 
[Ca^-J was monitored simultaneously with the redistribution of PKCa-F64L-S65T-GFP. Da- 
shed line indicates the addition times of carbachol. The top panel shows changes in the in- 
tracellular Ca^* concentration of individual cells with time for each treatment. The middle pa- 
nel shows changes in the average cytoplasmic GFP fluorescence for individual cells against 
time for each treatment. The bottom panel shows changes in the fluorescence of the perip- 
hery of single cells, within regions that specifically include the circumferential edge of a cell 
as seen in nomial projection, the regions which offers best chance to monitor changes in the 
fluorescence intensity of the plasma membrane. 



Figure 9. a) The hERK1-F64L-S65T-GFP fusion expressed in HEK293 cells treated with 100 
mM of the MEK1 inhibitor PD98059 in HAM F-12 (without serum) for 30 minutes at 37 "C. 
> The nuclei empty of fluorescence during this treatment. 

b) The same cells as in (a) following treatment with 10 % foetal calf serum for 15 minutes at 
37 "C. 

c) Time profiles for the redistribution of GFP fluorescence in HEK293 cells following treat- 
ment with various concentrations of EGF in Hepes buffer (HAM F-12 replaced with Hepes 
buffer directly before the experiment). Redistribution of fluorescence is expressed as the 
change in the ratio value between areas in nucleus and cytoplasm of single cells. Each time 
profile is the mean for the changes seen in six single cells. 

d) Bar chart for the end-point measurements. 600 seconds after start of EGF treatments, of 
fluorescence change (nudeusrcytoplasm) following various concentrations of EGF. 

Figure 10. 

a) The SMAD2-EGFP fusion expressed in HEK293 cells starved of serum overnight in HAM 
F-12. HAM F-12 was then replaced with Hepes buffer pH 7.2 immediately before the expe- 
riment. Scale bar is 10 mm. 

b) HEK 293 cells expressing the SMAD2-EGFP fusion were treated with various concentra- 
tion of TGF-beta as indicated, and the redistribution of fluorescence monitored against time. 
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The time profile plots represent increases in fluorescence within the nucleus, normalised to 
starting values in each cell measured. Each trace Is the time profile for a single cell nucleus. 

c) A bar chart representing the end-point change in fluorescence within nuclei (after 850 se- 
conds of treatment) for different concentrations of TGF-beta. Each bar is the value for a 
5 single nucleus in each treatment. 

Figure 11. The VASP-F64L-S65T-GFP fusion in CHO cells stably transfected with the hu- 
man insulin receptor. The cells were starved for two hours in HAM F-12 without serum, then 
treated with 10% foetal calf serum. The image shows the resulting redistribution of fluore- 
10 scence after 15 minutes of treatment. GFP fluorescence becomes localised in structures 
identified as focal adhesions along the length of actin stress fibres. 

Figure 12. Time lapse recording GLUT4-GFP redistribution in CHO-HIR cells. Time indicates 
minutes after the addition of 100 nM insulin. 



15 
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EXAMPLE 1 



Construction, testing and implementation of an assay for cAMP based on PKA activation in 
real time within living cells. 

Useful for monitoring the activity of signalling pathways which lead to altered concentrations 
of cAMP. e.g. activation of G-protein coupled receptors which couple to G-protelns of the Gs 
or G, class. 



The catalytic subunit of the murine cAMP dependent protein kinase (PKAc)was fused C- 
terminally to a F64L-S65T derivative of GFP. The resulting fusion (PKAc-F64L-S65T-GFP) 
was used for monitoring in vivo the translocation and thereby the activation of PKA. 

Construction of the PKAc-F64L-S65T-GFP fusion: 

Convenient restriction endonuclease sites were introduced into the cDNAs encoding murine 
PKAc (Gen Bank Accession number: M12303) and F64L-S65T-GFP (sequence disclosed in 
WO 97/1 1094) by polymerase chain reaction (PGR). The PCR reactions were performed ac- 
cording to standard protocols with the following primers: 

5'PKAc:TTggACACAAgCTTTggACACCCTCAggATATgggCAACgCCgCCgCCgCCAAg 
(SEQ ID NO:3), 

3'PKAc: gTCATCTTCTCgAgTCTTTCAggCgCgCCCAAACTCAgTAAACTCCTTgCCACAC 
(SEQ ID NO:4) . 

5'GFP: TTggACACAAgCTTTggACACggCgCgCCATgAgTAAAggAgAAgAACTTTTC (SEQ ID 
NO:1). 

3'GFP: gTCATCTTCTCgAgTCTTACTCCTgAggTTTgTATAgTTCATCCATgCCATgT (SEQ ID 
NO:2). 
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The PKAc ampiification product was then digested with Hindlll+AscI and the F64L-S65T- 
GFP product with Ascl+XhoL The two digested PGR products were subsequently iigated 
with a Hindlll+Xho! digested plasmid (pZeoSV® mammalian expression vector, Invitrogen, 
San Diego, CA, USA). The resulting fusion construct (SEQ ID NO:68 & 69) was under con- 
5 trol of the SV40 promoter. 

Transfection and cell culture conditions. 

Chinese hamster ovary cells (CHO), were transfected with the plasmid containing the PKAc- 
F64L-S65T-GFP fusion using the calcium phosphate precipitate method in HEPES-buffered 
saline (Sambrook et al., 1989). Stable transfectants were selected using 1000 mg Zeocin/ml 

10 (Invitrogen) in the growth medium (DMEM with 1000 mg glucose/1, 10 % fetal bovine serum 
(FBS), 100 mg penicillin-streptomycin mixture mr\ 2 mM L-glutamine purchased from Life 
Technologies Inc., Gaithersburg, MD, USA). Untransfected CHO ceils were used as the 
control. To assess the effect of glucagon on fusion protein translocation, the PKAc-F64L- 
S65T-GFP fusion was stably expressed in baby hamster kidney cells overexpressing the 

15 human glucagon receptor (BHK/GR cells) Untransfected BHK/GR cells were used as the 
control. Expression of GR was maintained with 500 mg G418/ml (Neo marker) andPKAc- 
F64L-S65T-GFP was maintained with 500 mg Zeocin/ml {Sh ble marker). CHO cells were 
also simultaneously co-transfected with vectors containing the PKAc-F64L-S65T-GFP fusion 
and the human a2a adrenoceptor (hARa2a). 

20 For fluorescence microscopy, cells were allowed to adhere to Lab-Tek chambered cover- 
glasses (Nalge Nunc Int.. Naperviile, IL, USA) for at least 24 hours and cultured to about 
80% confluence. Prior to experiments, the cells were cultured over night without selection 
pressure in HAM F-12 medium with glutamax (Life Technologies), 100 mg penicillin- 
streptomycin mixture ml'^ and 0.3 % FBS. This medium has low autofluorescence enabling 

25 fluorescence microscopy of cells straight from the incubator. 



Monitoring activity of PKA activity in real time: 

Image aquisition of live cells were gathered using a Zeiss Axiovert 135M fluorescence mi- 
croscope fitted with a Fluar 40X, NA: 1.3 oil immersion objective and coupled to a Photomet- 
30 rics CH250 charged coupled device (CCD) camera. The cells were illuminated with a 100 W 
HBO arc lamp. In the light path was a 470dk20 nm excitation filter, a 510 nm dichroic mirror 
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and a 515±15 nm emission filter for minimal image bacl<ground. The cells were kept and 
monitored to be at ST'C with a custom built stage heater. 

Images were processed and analyzed in the following manner:. 

Method 1: Stepwise procedure for quantitation of translocation of PKA: 
1 The image was corrected for dark cun-ent by performing a pixel-by-pixel subtraction of a 
dark image (an image taken under the same conditions as the actual image, except the 
camera shutter is not allowed to open). 

2. The image was corrected for non-uniformity of the illumination by performing a pixel-by- 
pixel ratio with a flat field correction image (an image taken under the same conditions as the 
actual image of a uniformly fluorescent spedmen). 

3. The image histogram, i.e.. the frequency of occurrence of each Intensity value in the im- 
age, was calculated. 

4. A smoothed, second derivative of the histogram was calculated and the second zero is 
determined. This zero corresponds to the inflection point of the histogram on the high side of 
the main peak representing the bulk of the image pixel values. 

5. The value determined in step 4 was subtracted from the image. All negative values were 
discarded. 

6. The variance (square of the standard deviation) of the remaining pixel values was deter- 
mined. This value represents the "response" for that image. 

7. Scintillation proximity assay (SPA) for independent quantitation of cAMP: 
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Method 2: Alternative method for quantitation of PKA redistribution: 

1 . The fluorescent aggregates are segmented from each image using an automatically 

5 found threshold based on the maximisation of the information measure between the object 
and background. The a priori entropy of the image histogram is used as the information me- 
asure. 

2. The area of each image occupied by the aggregates is calculated by counting pixels in 
the segmented areas. 

10 3. The value obtained in step 2 for each image in a series, or treatment pair, is normalised to 
the value found for the first (unstimulated) image collected. A value of zero (0) indicates no 
redistribution of fluorescence from the starting condition. A value of one (1) by this method 
equals full redistribution. 

15 Cells were cultured in HAM F-12 medium as described above, but in 96-well plates. The me- 
dium was exchanged with Ca^*-HEPES buffer including 100 mM IBMX and the cells were 
stimulated with different concentrations of forskolin for 10 min. Reactions were stopped with 
addition of NaOH to 0.14 M and the amount of cAMP produced was measured with the 
cAMP-SPA kit, RPA538 (Amersham) as described by the manufacturer. 

20 

Manipulating intracellular levels of cAMP to test the PKAc-F64L-S65T-GFP fusion. 

The following compounds were used to vary cAMP levels: Forskolin, an activator of adeny- 
late cyclase; dbcAMP, a membrane permeable cAMP analog which is not degraded by 
phosphodiesterase; IBMX. an inhibitor of phosphodiesterase. 

25 CHO cells stably expressing the PKAc-F64L-S65T-GFP, showed a dramatic translocation of 
the fusion protein from a punctate distribution to an even distribution throughout the cyto- 
plasm following stimulation with 1 mM forskolin (n=3), 10 mM forskolin (n=4) and 50 mM 
forskolin (n=4) (Fig 1), or dbcAMP at ImM (n=6). 

Fig. 2 shows the progression of response in time following treatment with 1 mM forskolin. 
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Fig. 3 gives a comparison of the average temporal profiles of fusion protein redistribution 
and a measure of the extent of each response to the three forskolin concentrations (Fig. 3A. 
E. B). and to 1 mM dbcAMP (fig 3C) which caused a similar but slower response, and to ad- 
dition of 100 mM IBMX (n=4. Fig. 3D) which also caused a slow response, even in the ab- 
sence of adenylate cyclase stimulation. Addition of buffer (n=2) had no effect (data not 
shown). 

As a control for the behavior of the fusion protein. F64L-S65T-GFP alone was expressed in 
CHO cells and these were also given 50 mM forskolin (n=5): the uniform diffuse distribution 
characteristic of GFP in these cells was unaffected by such treatment (data not shown). 
The forskolin induced translocation of PKAC-F64L-S65T-GFP showed a dose-response re- 
lationship (Fig 4 and 6). see quantitative procedures above. 

Reversibility of PKAc-F64L-S65T-GFP translocation. 

The release of the PKAc probe from its cytoplasmic anchoring hotspots was revereible. 
Washing the cells repeatedly (5-8 times) with buffer after lOpM forskolin treatment comple- 
tely restored the punctate pattem within 2-5 min (n=2. Fig. 3E). In fact the fusion protein re- 
turned to a pattem of fluorescent cytoplasmic aggregates virtually indistinguishable from that 
observed before forskolin stimulation. 

To test whether the retum of fusion protein to the cytoplasmic aggregates reflected a decre- 
20 ased [CAMP],, cells were treated with a combination of 10 mM forskolin and 100 mM IBMX 
(n=2) then washed repeatedly (5-8 times) with buffer containing 100 mM IBMX (Fig. 3F). In 
these experiments, the fusion protein did not retum to its preslimulatory localization after 
removal of forskolin. 
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Testing the PKA-F64L-S65T-GFP probe with physiologically relevant agents. 
To test the probe's response to receptor activation of adenylate cyclase, BHK cells stably 
transfected with the glucagon receptor and the PKA-F64L-S65t-GFP probe were exposed to 
glucagon stimulation. The glucagon receptor Is coupled to a Gs protein which activates 
adenylate cyclase, thereby increasing the cAMP level, in these cells, addition of 100 nM glu- 
cagon (n=2) caused the release of the PKA-F64L-S65T-GFP probe from the cytoplasmic 
aggregates and a resulting translocation of the fusion protein to a more even cytoplasmic 
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distribution within 2-3 min (Fig. 3G). Similar but less pronounced effects were seen at lower 
glucagon concentrations (n=2, data not shown). Addition of buffer (n=2) had no effect over 
time (data not shown). 

Transiently transfected CHO cells expressing hARa2a and the PKA-F64L-S65T-GFP probe 
5 were treated with 10 mM forskolin for 7.5 minutes, then, in the continued presence of 
forskolin. exposed to 10 mM norepinephrine to stimulate the exogenous adrenoreceptors, 
which couple to a G, protein, which inhibit adenylate cyclase. This treatment led to reappear- 
ance of fluorescence in the cytoplasmic aggregates indicative of a decrease in [cAMP]i (Fig. 
3H). 

10 

Fusion protein translocation correlated with [cAMPJj 

As described above/the time it took for a response to come to completion was dependent on 
the forskolin dose (Fig. 5) In addition the degree of responses was also dose dependent. To 
test the PKA-F64L-S65T-GFP fusion protein translocation in a semi high through-put sy- 

15 stem, CHO cells stably transfected with the PKA-F64L-S65T-GFP fusion was stimulated 
with buffer and 5 increasing doses of forskolin (n=8). Using the image analysis algorithm de- 
scribed above (Method 1), a dose response relationship was observed in the range from 
0.01-50 mM forskolin (Fig. 6). A half maximal stimulation was observed at about 2 mM for- 
skolin. In parallel, cells were stimulated with buffer and 8 increasing concentrations of for- 

20 skolin (n=4) in the range 0.01-50 mM. The amount of cAMP produced was measured in an 
SPA assay. A steep increase was observed between 1 and 5 mM forskolin coincident with 
the steepest part of the curve for fusion protein translocation (also Fig. 6) 



25 EXAMPLE 2 

Quantitation of redistribution in real-time within living cells. 
Probe for detection of PKC activity in real time within living cells: 

Construction of PKC-GFP fusion: 
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The probe was constructed by ligating two restriction enzyme treated polymerase chain re- 
action (PGR) amplification products of the cDNA for murine PKCa (GenBank Accession 
number: M25811) and F64L-S65T-GFP (sequence disclosed in WO 97/11094) respectively. 
Taq® polymerase and the following oligonucleotide primers were used for PGR; 

5 5'mPKCa: TTggACACAAgGTTTggACAGGGTGAggATATggCTgACgTTTACCGggGCAACg 
(SEQ ID NO:5). 

3 mPKCa: gTCATCTTGTGgAgTGTTTGAggGgCgGCCTACTgGACTTTgCAAgATTgggTgC 
(SEQ ID NO:6), 

5-F64L-S65T-GFP:TTggACAGAAgCTTTggAGACggCgCgCCATgAgTAAAggAgAAgAACTr- 
10 TrC(SEQIDNO:1). 

3-F64L-S65T-GFP:gTGATCTTCTGgAgTCTTACTCCTgAggTTTgTATAgTTGATCCATgC- 
CATgT (SEQ ID NO:2). 

The hybrid DNA strand was inserted into the pZeoSV® mammalian expression vector as a 
Hindlll-Xhol casette as described in example 1. 

15 Cell Culture: 

BHK ceils expressing the human Ml receptor under the control of the inducible metal- 
lothionine promoter and maintained with the dihydrofolate reductase marker were trans- 
fected with the PKCa-F64L-S65T-GFP probe using the calcium phosphate precipitate 
method in HEPES buffered saline (HBS [pH 7.10]). Stable transfectants were selected using 
1000 ug Zeocln(Syml in the growth medium (DMEM with 1000 mg glucose/1. 10 % foetal bo- 
vine serum (FBS). 100 mg penicillin-streptomycin mixture ml-1, 2 mM l-glutamine). The hMI 
receptor and PKCa-F64L-S65T-GFP fusion protein were maintained with 500 nM 
methotrexate and 600 pg Zeodn®/ml respectively. 24 hours prior to any experiment, the 
cells were transferred to HAM F-12 medium with glutamax. 100 pg penicillin-streptomycin 
mixture ml ' and 0.3 % FBS. This medium relieves selection pressure, gives a low Induction 
of signal transduction pathways and has a low autofluorescence at the relevant wavelength 
enabling fluorescence microscopy of cells straight from the incubator. 

Monitoring the PKC activity in real time: 

Digital images of live cells were gathered using a Zeiss Axiovert 135M fluorescence micro- 
30 scope fitted with a 40X. NA: 1 .3 oil immersion objective and coupled to a Photometries 
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CH250 charged coupled device (CCD) camera. The cells were Illuminated with a 100 W arc 
lamp. In the light path was a 470±20 nm excitation filter, a 510 nm dichroic mirror and a 
515±15 nm emission filter for minimal image background. The cells were kept and monitored 
to be at 37°C with a custom built stage heater. 

5 Images were analyzed using the IPLab software package for Macintosh. 

Upon stimulation of the M1-BHK cells, stably expressing the PKCa-F64L-S65T-GFP fusion, 
with carbachol we observed a dose-dependent transient translocation from the cytoplasm to 
the plasma membrane (Fig. 7a,b.c). Simultaneous measurement of the cytosolic free cal- 
cium concentration shows that the carbachol-induced calcium mobilisation precedes the 
10 translocation (Fig, 8). 



Stepwise procedure for quantitation of translocation of PKC: 

1 . The image was corrected for dark current by performing a pixel-by-pixel subtraction of a 
dark image (an image taken under the same conditions as the actual image, except the 
camera shutter Is not allowed to open). 

15 2. The image was corrected for non-uniformity of the illumination by performing a pixel-by- 
pixel ratio with a flat field correction image (an image taken under the same conditions as the 
actual image of a uniformly fluorescent specimen). 

3. A copy of the image was made in which the edges are identified. The edges in the image 
are found by a standard edge-detection procedure - convolving the image with a kernel 

20 which removes any large-scale unchanging components (i.e., background) and accentuates 
any small-scale changes (i.e., sharp edges). This image was then converted to a binary im- 
age by threshholding. Objects in the binary image which are too small to represent the 
edges of cells were discarded. A dilation of the binary image was performed to close any 
gaps in the image edges. Any edge objects in the image which were in contact with the bor- 

25 ders of the image are discarded. This binary image represents the edge mask. 

4. Another copy of image was made via the procedure in step 3. This copy was further 
processed to detect objects which enclose "holes" and setting all pixels inside the holes to 
the binary value of the edge, i.e., one. This image represents the whole cell mask. 

5- The original image was masked with the edge mask from step 3 and the sum total of all 
30 pixel values is determined. 
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6. The original image was masked with the whole cell mask from step 4 and the sum total 
of all pixel values was determined. 

7. The value from step 5 was divided by the value from step 6 to give the final result, the 
fraction of fluorescence intensity in the cells which was localized in the edges. 



EXAMPLE 3 

Probes for detection of mitogen activated protein kinase Erki redistribution. 

Useful for monitoring signalling pathways involving MAPK. e.g. to identify compounds which 
modulate the activity of the pathway in living cells. 

Erki. a serine/threonine protein kinase, is a component of a signalling pathway which is acti- 
vated by e.g. many growth factors. 



Probes for detection of ERK-1 activity in real time within living cells: 

The extracellular signal regulated kinase (ERK-1. a mitogen activated protein kinase. MAPK) 
is fused N- or C-terminally to a derivative of GFP. The resulting fusions expressed in differ- 
ent mammalian cells are used for monitoring in vivo the nuclear translocation, and thereby 
the activation, of ERK1 in response to stimuli that activate the MAPK pathway. 

a) Construction of murine ERK1 - F64L-S65T-GFP fusion: 

Convenient restriction endonuclease sites are introduced into the cDNAs encoding murine 
ERK1 (GenBank Accession number: Z14249) and F64L-S65T-GFP (sequence disclosed in 
WO 97/11094) by polymerase chain reaction (PCR). The PGR reactions are performed ac- 
cording to standard protocols with the following primers: 

5'ERK1 TTggACACAAgCTTTggACACCCTCAggATATggCggCggCggCggCggCTCCggggg- 
gCgggg (SEQ id N0:7), 
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3'ERK1 : gTCATCTTCTCgAgTCTTTCAggCgCgCCCggggCCCTCTggCgCCCCTggCTgg 
(SEQ ID NO:8), 

5'F64L-S65T-GFP: TTggACACAAgCTTTggACACggCgCgCCATgAgTAAAggAgAAgAACTT- 
TTC (SEQ ID NO:1) 

5 3'F64L-S65T-GFP: gTCATCTTCTCgAgTCTTACTCCTgAggTTTgTATAgTTCATCCATgC- 
CATgT (SEQ ID NO:2) 

To generate the mERK1-F64L-S65T-GFP (SEQ ID NO:56 & 57) fusion the ERK1 amplifica- 
tion product is digested with Hindlll+AscI and the F64L-S65T-GFP product with Ascl+Xhol. 
To generate the F64L-S65T-GFP-nnERK1 fusion the ERK1 amplification product is then di- 
10 gested with Hindlll+Bsu36l and the F64L-S65T-GFP product with Bsu36l+Xhol.The two 
pairs of digested PGR products are subsequently ligated with a Hindill+Xhol digested pias- 
mid (pZeoSV® mammalian expression vector, Invitrogen, San Diego, OA. USA). The resul- 
ting fusion constructs are under control of the SV40 promoter. 



15 b) The human Erk1 gene (GenBank Accession number: X60188) was amplified using PGR 
according to standard protocols with primers Erkl-top (SEQ ID NO:9) and Erk1- 
bottom/+stop (SEQ ID NO: 10) . The PGR product was digested with restriction enzymes E- 
coR1 and BamH1, and ligated into pEGFP-CI (Clontech, Palo Alto; GenBank Accession 
number U55763) digested with EcoR1 and BamH1. This produces an EGFP-Erk1 fusion 

20 (SEQ ID NO:38 &39) under the control of a CMV promoter. 



The piasmid containing the EGFP-Erki fusion was transfected into HEK293 cells employing 
the FUGENE transfection reagent (Boehringer Mannheim). Prior to experiments the cells 
were grown to 80%-90% confluency 8 well chambers in DMEM with 10% FCS. The cells we- 

25 re washed in plain HAM F-12 medium (without FCS), and then incubated for 30-60 minutes 
in plain HAM F-12 (without FCS) with 100 micromolar PD98059, an inhibitor of MEK1 . a ki- 
nase which activates Erk1; this step effectively empties the nucleus of EGFP-Erki. Just be- 
fore starting the experiment, the HAM F-12 was replaced with Hepes buffer following a wash 
with Hepes buffer. This removes the PD98059 inhibitor; if blocking of MEK1 is still wanted 

30 (e.g. in control experiments), the inhibitor is included In the Hepes buffer. 

The experimental setup of the microscope was as described in example 1. 
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60 images were collected with 10 seconds between each, and with the test compound added 
after image number 10. 



Addition of EGF (1-100 nM) caused within minutes a redistribution of EGFP-Erki from the 
cytoplasm into the nucleus (Fig. 9a,b). 

The response was quantitated as described below and a dose-dependent relationship bet- 
ween EGF concentration and nuclear translocation of EGFP-Erki was found (Fig. 9c.d). Re- 
ditribution of GFP fluorescence is expressed in this example as the change in the ratio value 
between areas in nuclear versus cytoplasmic compartments of the cell. Each time profile is 
the average of nuclear to cytoplasmic ratios from six cells in each treatment. 

EXAMPLE 4: 

Probes for detection of Erk2 redistribution. 

Useful for monitoring signalling pathways involving MAPK, e.g. to identify compounds which 
nradulate the activity of the pathway in living cells. 

Erk2. a serine/threonine protein kinase, is closely related to Erki but not identical; It is a 
component of a signalling pathway which is activated by e.g. many growth factors. 

a) The rat Erk2 gene (GenBank Accession number: M64300) was amplified using PGR ac- 
cording to standard protocols with primers Erk2-top (SEQ ID NO:1 1) and Erk2-bottom/+stop 
(SEQ ID NO:13) The PGR product was digested with restriction enzymes Xhol and BamHI. 
and ligated into pEGFP-G1 (Clontech. Palo Alto; GenBank Accession number U55763) dige- 
sted with Xho1 and BamHI. This produces an EGFP-Erk2 fusion (SEQ ID NO:40 &41) under 
the control of a GMV promoter. 

b) The rat Erk2 gene (GenBank Accession number M64300) was amplified using PGR ac- 
cording to standard protocols with primers (SEQ ID NO:11) Erk2-top and Erk2-bottom/-stop 
(SEQ ID NO:12). The PGR product was digested with restriction enzymes Xhol and BamHI. 
and ligated into pEGFP-NI (Clontech. Palo Alto; GenBank Accession number U55762) dige- 
sted with Xhol and BamHI. This produces an Erk2-EGFP fusion (SEQ ID NO:58 &59) under 
the control of a CMV promoter. 
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The resulting plasmlds were transfected into CHO cells and BHK cells. The cells were grown 
under standard conditions. Prior to experiments, the cells were starved in nnedium without 
serum for 48-72 hours. This led to a predominantly cytoplasmic localization of both probes, 
5 especially in BHK cells. 10% fetal calf serum was added to the cells and the fluorescence of 
the cells was recorded as explained in example 3. Addition of serum caused the probes to 
redistribute into the nucleus within minutes of addition of serum. 

EXAMPLE 5: 

10 Probes for detection of Smad2 redistribution. 

Useful for monitoring signalling pathways activated by some members of the transforming 
growth factor-beta family, e.g. to Identify compounds which modulate the activity of the pa- 
thway in living cells. 

Smad 2, a signal transducer, is a component of a signalling pathway which is induced by 
1 5 some members of the TGFbeta family of cytokines. 

a) The human Smad2 gene (GenBank Accession number: AF027964) was amplified using 
PGR according to standard protocols with primers Smad2-top (SEQ ID NO:24) and Smad2- 
bottom/+stop (SEQ ID NO:26) . The PGR product was digested with restriction enzymes E- 

20 coR1 and Acc65l, and ligated into pEGFP-GI (Glontech; Palo Alto; GenBank Accession 
number U55763) digested with EcoRI and Acc65l. This produces an EGFP-Smad2 fusion 
(SEQ ID NO:50&51) under the control of a GMV promoter. 

b) The human Smad2 gene (GenBank Accession number: AF027g64) was amplified using 
PGR according to standard protocols with primers Smad2-top (SEQ ID NO:24) and Smad2- 

25 bottom/-stop (SEQ ID NO:25) . The PGR product was digested with restriction enzymes E- 
coRI and Acc65l, and ligated into pEGFP-NI (Glontech, Palo Alto; GenBank Accession 
number U55762) digested with EcoRI and Acc65l. This produces a Smad2-EGFP fusion 
(SEQ ID NO:74 &75) under the control of a GMV promoter. 

30 The plasmid containing the EGFP-Smad2 fusion was transfected into HEK293 cells, where it 
showed a cytoplasmic distribution. Prior to experiments the cells were grown in 8 well Nunc 
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Chambers in DMEM with 10% FCS to 80% confiuency and starved overnight in HAM F-12 
medium without FCS. 

For experiments, the HAM F-12 medium was replaced with Hepes buffer pH 7.2. 

The experimental setup of the microscope was as described in example 1 . 

90 images were collected with 10 seconds between each, and with the test compound added 
after image number 5. 



After serum starvation of cells, each nucleus contains less GFP fluorescence than the sur- 
rounding cytoplasm (Fig. 10a). Addition of TGFbeta caused within minutes a redistribution of 
EGFP-Srfiad2 from the cytoplasma into the nucleus (Fig. 10b), 

The redistribution of fluorescence within the treated cells was quantified simply as the fracti- 
onal increase in nuclear fluorescence normalised to the starting value of GFP fluorescence 
in the nucleus of each unstimulated cell. 

EXAMPLE 6: 

Probe for detection of VASP redistribution. 

Useful for monitoring signalling pathways involving rearrangement of cytoskeletal elements, 
e.g. to identify compounds which modulate the activity of the pathway in living cells. 

VASP. a phosphoprotein. is a component of cytoskeletal structures, which redistributes in 
response to signals which affect focal adhesions. 

a) The human VASP gene (GenBank Accession number: Z46389) was amplified using PGR 
according to standard protocols with primers VASP-top (SEQ ID NO:94) and VASP- 
botfomZ-^stop (SEQ ID NO:95). The PGR product was digested with restriction enzymes 
Hind3 and BamH1. and ligated into pEGFP-CI (Clontech. Palo Alto; GenBank Accession 
number U55763) digested with Hind3and BamHI. This produces an EGFP-VASP fusion 
(SEQ ID NO: 124 &125) under the control of a CMV promoter. 

The resulting plasmid was transfected into CHO cells expressing the human insulin receptor 
using the calcium-phosphate transfection method. Prior to experiments, cells were grown in 
8 well Nunc chambers and starved overnight In medium without FCS. 
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Experiments are performed in a microscope setup as described in example 1 . 

10% FCS was added to the cells and images were collected. The EGFP-VASP fusion was 
redistributed from a somewhat even distribution near the periphery into more localized 
structures, identified as focal adhesion points (Fig. 11). 

5 

A large number of further GFP fusions have been made or are in the process of being made, 
as apparent from the following Examples 7-22 which also suggest suitable host cells and 
substances for activation of the cellular signalling pathways to be monitored and analyzed. 

10 

EXAMPLE 7: 

Probe for detection of actin redistribution. 

1 5 Useful for monitoring signalling pathways involving rearrangement or formation of actin fila- 
ments, e.g. to identify compounds which modulate the activity of pathways leading to cyto- 
skeletal rearrangements in living cells. 

Actin is a component of cytoskeletal structures, which redistributes in response to very many 
cellular signals. 

20 

The actin binding domain of the human alpha-actinin gene (GenBank Accession number: 
XI 5804) was amplified using PGR according to standard protocols with primers ABD-top 
(SEQ ID NO:90) and ABD-botlom/-stop (SEQ ID NO:91). The PGR product was digested 
with restriction enzymes Hind3 and BamH1, and ligated into pEGFP-N1 (Clontech, Palo Alto; 
25 GenBank Accession number U55762) digested with Hind3 and BamH1 . This produced an 
actin-binding-domain-EGFP fusion (SEQ ID NO: 128 &129) under the control of a CMV pro- 
moter. 

The resulting plasmid was transfected into CHO cells expressing the human insulin receptor. 
Cells were stimulated with Insulin which caused the actio binding domaln-EGFP probe to be- 
30 come redistributed into morphologically distinct membrane-associated structures. 
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Example 8: 

Probes for detection of p38 redistribution. 

Usefu! for monitoring signalling pathways responding to various cellular stress situations, 
e.g. to identify compounds which modulate the activity of the pathway in living cells, or as a 
counterscreen. 

p38. a serine/thronine protein kinase, is a component of a stress-induced signalling pathway 
which is activated by many types of cellular stress, e.g. TNFalpha. anisomycln. UV and mi- 
tomycin C. 



a) The human p38 gene (GenBank Accession number: L35253) was amplified using PGR 
according to standard protocols with primers p38-top (SEQ ID NO:14) and p38- 
bottom/+stop (SEQ ID NO: 16). The PGR product was digested with restriction enzymes 
Xho1 and BamH1. and ligated into pEGFP-C1 (Glontech. Palo Alto; GenBank Accession 
number U55763) digested with Xhol and BamH1. This produced an EGFP-p38 fusion (SEQ 
ID NO:46 &47) under the control of a CI\4V promoter. 

b) The human p38 gene (GenBank Accession number L35253) was amplified using PGR 
according to standard protocols with primers p38-top (SEQ ID NO:13) and p38-bottom/-stop 
(SEQ ID NO:15) . The PGR product was digested with restriction enzymes Xho1 and 
BamH1 , and ligated into pEGFP-N1 (Glontech. Palo Alto; GenBank Accession number 
U55762) digested with Xho1 and BamHI. This produced a p38-EGFP fusion (SEQ ID NO:64 
&65) under the control of a GMV promoter. 

The resulting plasmids are transfected into a suitable cell line. e.g. HEK293. in which the 
EGFP-P38 probe and/or the p38-EGFP probe should change its cellular distributton from 
predominantly cytoplasmic to nuclear within minutes in response to activation of the signal- 
ling pathway with e.g. anisomycin. 



Example 9: 

Probes for detection of Jnki redistribution. 
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Useful for monitoring signalling pathways responding to various cellular stress situations, 
e.g. to identify compounds which modulate the activity of the pathway in living cells, or as a 
counterscreen. 

Jnk1, a serine/threonine protein kinase, is a component of a stress-induced signalling pa- 
5 thway different from the p38 described above, though it also is activated by many types of 
cellular stress, e.g. TNFalpha, anisomycin and UV. 



a) The human Jnk1 gene (GenBank Accession number: L26318) was amplified using PGR 
according to standard protocols with primers Jnk-top (SEQ ID NO: 17) and Jnk-bottom/+stop 

10 (SEQ ID NO: 19) . The PGR product was digested with restriction enzymes Xho1 and 
BamHI , and ligated into pEGFP-CI (Glontech, Palo Alto; GenBank Accession number 
U55763) digested with Xho1 and BamHI. This produced an EGFP-Jnki fusion (SEQ ID 
NO:44 &45) under the control of a CMV promoter. 

b) The human Jnk1 gene (GenBank Accession number: L26318) was amplified using PGR 
15 according to standard protocols with primers Jnk-top (SEQ ID NO:17) and Jnk-bottom/-stop 

(SEQ ID NO:18) . The PGR product was digested with restriction enzymes Xho1 and 
BamHI, and ligated Into pEGFP-NI (Glontech, Palo Alto; GenBank Accession number 
U55762) digested with Xhol and BamHI. This produced a Jnkl-EGFP fusion (SEQ ID 
NO:62 &63) under the control of a CMV promoter. 

20 The resulting plasmids are transfected into a suitable cell line, e.g. HEK293, in which the 
EGFP-Jnk1 probe and/or the Jnk1-EGFP probe should change its cellular distribution from 
predominantly cytoplasmic to nuclear in response to activation of the signalling pathway with 
e.g. anisomycin. 



Example 10: 

Probes for detection of PKG redistribution. 

Useful for monitoring signalling pathways involving changes in cyclic GMP levels, e.g. to 
identify compounds which modulate the activity of the pathway in living cells. 

30 PGK. a cGMP-dependent serine/threonine protein kinase, mediates the guanylyl- 
cyclase/cGMP signal. 
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a) The human PKG gene (GenBank Accession number: Y07512) is amplified using PGR ac- 
cording to standard protocols with primers PKG-top (SEQ ID NO:81) and PKG-bottom/+stop 
(SEQ ID NO:83) . The PCR product is digested with restriction enzymes Xho1 and BamH1. 
and ligated into pEGFP-C1 (Clontech. Palo Alto; GenBank Accession number U55763) dige- 
sted with Xhol and BamHI. This produces an EGFP-PKG fusion (SEQ ID NO:134 &135) 
under the control of a CMV promoter. 

b) The human PKG gene (GenBank Accession number: Y07512) is amplified using PCR ac- 
cording to standard protocols with primers PKG-top (SEQ ID NO:81) and PKG-bottom/-stop 
(SEQ ID NO: 82) . The PCR product is digested with restriction enzymes Xhol and BamHI. 
and ligated into pEGFP-NI (Clontech. Palo Alto; GenBank Accession number U55762) dige- 
sted with Xhol and BamHI. This produces a PKG-EGFP fusion (SEQ ID NO:136 &137) un- 
der the control of a CMV promoter. 

The resulting plasmids are transfected into a suitable cell line. e.g. A10. in which the EGFP- 
PKG probe and/or the PKG-EGFP probe should change its cellular distribution from cyto- 
plasmic to one associated with cytoskeletal elements within minutes in response to treatment 
with agents which raise nitric oxide (NO) levels. 



Example 11: 

20 Probes for detection of IkappaB kinase redistribution. 

Useful for monitoring signalling pathways leading to NFkappaB activation, e.g. to identify 
compounds which modulate the activity of the pathway in living cells. 
IkappaB kinase, a serine/threonine kinase, is a component of a signalling pathway which is 
activated by a variety of inducers including cytokines, lymphokines. growth factors and 
25 stress. 



30 



a) The alpha subunit of the human IkappaB kinase gene (GenBank Accession number: 
AF009225) is amplified using PCR according to standard protocols with primers IKK-top 
(SEQ ID NO:96) and IKK-bottom/+stop (SEQ ID NO:98). The PCR product is digested with 
restriction enzymes EcoR1 and Acc65l. and ligated into pEGFP-CI (Clontech. Palo Alto; 
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GenBank Accession number U55763) digested with EcoRland Acc65I. This produces an 
EGFP-lkappaB-klnase fusion (SEQ ID NO:120 &121) under the control of a CMV promoter, 

b) The alpha subunit of the human ikappaB kinase gene (GenBank Accession number: 
AF009225) is amplified using PGR according to standard protocols with primers IKK-top 
(SEQ ID NO:96) and IKK-bottomAstop (SEQ ID NO:97). The PGR product is digested with 
restriction enzymes EcoR1 and Acc65l. and ligated into pEGFP-N1 (Clontech, Palo Alto; 
GenBank Accession number U55762) digested with EcoR1 and Acc65L This produces an 
IkappaB-kinase-EGFP fusion (SEQ ID NO: 122 &123) under the control of a CMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. Jurkat, in which the 
EGFP-lkappaB-kinase probe and/or the IkappaB-kinase-EGFP probe should achieve a more 
cytoplasmic distribution within seconds following stimulation with e.g. TNFalpha. 

Example 12: 

Probes for detection of CDK2 redistribution. 

Useful for monitoring signalling pathways of the cell cycle, e.g. to identify compounds which 
modulate the activity of the pathway in living cells. 

CDK2. a cyclin-dependent serine/threonine kinase, is a component of the signalling system 
which regulates the cell cycle. 

a) The human CDK2 gene (GenBank Accession number: X61622) is amplified using PGR 
according to standard protocols with primers CDK2-top (SEQ ID NO: 102) and CDK2- 
bottom/+stop (SEQ ID NO: 104). The PGR product is digested with restriction enzymes Xhol 
and BamHI, and ligated into pEGFP-C1 (Glontech, Palo Alto; GenBank Accession number 
U55763) digested with Xhol and BamHI. This produces an EGFP-CDK2 fusion (SEQ ID 
NO:1 14 &1 1 5) under the control of a CMV promoter. 

b) The human CDK2 gene (GenBank Accession number: X61622) is amplified using PGR 
according to standard protocols with primers CDK2-top (SEQ ID NO: 102) and CDK2- 
bottom/-stop (SEQ ID NO:103). The PGR product is digested with restriction enzymes Xho1 
and BamHI, and ligated into pEGFP-NI (Clontech, Palo Alto; GenBank Accession number 
U55762) digested with Xho1 and BamHI. This produces a CDK2-EGFP fusion (SEQ ID 
NO:1 12 &1 13) under the control of a CMV promoter. 
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The resulting plasmids are transfected into a suitable cell line. e.g. HEK293 in which the 
EGFP-CDK2 probe and/or the CDK2-EGFP probe should change its cellular distribution 
from cytoplasmic in contact-inhibited cells, to nuclear location in response to activation with a 
number of growth factors, e.g. IGF. 



Example 13: 

Probes for detection of Grk5 redistribution. 



re- 



Useful for monitoring signalling pathways involving desensitization of G-protein coupled i 
ceptors, e.g. to identify compounds which modulate the activity of the pathway in living cells. 

Grk5, a G-protein coupled receptor kinase, is a component of signalling pathways involving 
membrane bound G-protein coupled receptors. 



a) The human Grk5 gene (GenBank Accession number: L15388) is amplified using PGR ao 
cording to standard protocols with primers Grk5-top (SEQ ID NO:27) and Grk5- 
bottom/+stop (SEQ ID NO:29). The PGR product is digested with restriction enzymes EcoR1 
and BamHI, and ligated into pEGFP-G1 (Clontech. Palo Alto; GenBank Accession number 
U55763) digested with EcoR1 and BamHI. This produces an EGFP-Grk5 fusion (SEQ ID 
NO:42 &43) under the control of a GMV promoter. 

b) The human Grk5 gene (GenBank Accession number: L15388) is amplified using PGR ac- 
cording to standard protocols with primers Grk5-top (SEQ ID NO:27) and Grk5-bottom/-stop 
(SEQ ID NO:28). The PGR product is digested with restriction enzymes EcoR1 and BamHI, 
and ligated into pEGFP-N1 (Clontech, Palo Alto; GenBank Accession number U55762) digj- 
sted with EcoR1 and BamHI . This produces a Grk5-EGFP fusion (SEQ ID NO:60 &61) un- 
der the control of a GMV promoter. 

The resulting plasmids are transfected info a suitable cell line, e.g. HEK293 expressing a rat 
dopamine D1 A receptor, in which the EGFP-GrkS probe and/or the Grk5-EGFP probe 
should change its cellular distribution from predominantly cytoplasmic to peripheral in re- 
sponse to activation of the signalling pathway with e.g. dopamine. 



Example 14: 
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Probes for detection of Zap70 redistribution. 

Useful for monitoring signalling pathways involving the T cell receptor, e.g. to identify com- 
pounds which modulate the activity of the pathway in living cells. 

Zap70, a tyrosine kinase, is a component of a signalling pathway which is active in e.g. T-cell 
5 differentiation. 



a) The human Zap70 gene (GenBank Accession number: L05148) is amplified using PGR 
according to standard protocols with primers Zap70-top (SEQ ID NO:105) and Zap70- 
bottom/+stop (SEQ ID NO: 107). The PGR product is digested with restriction enzymes E- 

10 coR1 and BamH1, and ligated into pEGFP-CI (GenBank Accession number U55763) dige- 
sted with EcoR1 and BamH1. This produces an EGFP-2ap70 fusion (SEQ ID NO:108 &109) 
under the control of a GMV promoter. 

b) The human Zap70 gene (GenBank Accession number: L05148) is amplified using PGR 
according to standard protocols with primers Zap70-top (SEQ ID NO: 105) and Zap70- 

15 bottom/-stop (SEQ ID NO: 106). The PGR product is digested with restriction enzymes E- 
coRI and BamHI, and ligated into pEGFP-N1 (Glontech, Palo Alto; GenBank Accession 
number U55762) digested with EcoR1 and BamHI. This produces a Zap70-EGFP fusion 
(SEQ ID NO:1 10 &1 1 1 ) under the control of a GMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. Jurkat, in which the 
20 EGFP-Zap70 probe and/or the Zap70-EGFP probe should change its cellular distribution 
from cytoplasmic to membrane-associated within seconds in response to activation of the T 
cell receptor signalling pathway with e.g. antibodies to GDSepsilon. 



Example 15: 

25 Probes for detection of p85 redistribution. 

Useful for monitoring signalling pathways involving PI-3 kinase, e.g. to identify compounds 
which modulate the activity of the pathway in living cells. 

p85alpha is the regulatory subunit of PI3-kinase which is a component of many pathways 
involving membrane-bound tyrosine kinase receptors and G-protein-coupled receptors. 
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a) The human p85alpha gene (GenBank Accession number: M61906) was amplified using 
PGR according to standard protocols with primers p85-top-C (SEQ ID NO:22) and p85- 
bottom/+stop (SEQ ID NO:23) . The PGR product was digested with restriction enzymes 
Bgl2 and BamH1. and ligated into pEGFP-GI (Glontech. Palo Alto; GenBank Accession 
number U55763) digested with Bgl2 and BamHI. This produced an EGFP-p85alpha fusion 
(SEQ ID NO:48 &49) under the control of a CMV promoter. 

b) The human p85alpha gene (GenBank Accession number: M61906) was amplified using 
PGR according to standard protocols with primers p85-top-N (SEQ ID NO:20) and p85- 
bottom/-stop (SEQ ID NO:21) . The PGR product was digested with restriction enzymes E- 
coRl and BamHI, and ligated into pEGFP-N1 (Glontech. Palo Alto; GenBank Accession 
number U55762) digested with EcoRI and BamHI. This produced a p85alpha-EGFP fusion 
(SEQ ID NO:66 &67) under the control of a GMV promoter. 

The resulting plasmids are transfected into a suitable cell line. e.g. CHO expressing the hu- 
man insulin receptor, in which the EGFP-p85 probe and/or the pSS-EGFP probe may change 
its cellular distribution from cytoplasmic to membrane-associated within minutes in response 
to activation of the receptor with insulin. 

Example 16: 

Probes for detection of protein-tyrosine phosphatase redistribution. 

Useful for monitoring signalling pathways involving tyrosine kinases, e.g. to identify com- 
pounds which modulate the activity of the pathway in living cells. 

Protein-tyrosine phosphatase! C. a tyrosine-specific phosphatase, is an inhibitory compo- 
nent in signalling pathways involving e.g. some growth factors. 

a) The human protein-tyrosine phosphatase 1C gene (GenBank Accession number: X62055) 
Is amplified using PGR according to standard protocols with primers PTP-top (SEQ ID 
NO:99) and PTP-bottom/+stop (SEQ ID NO:101). The PGR product is digested with restricti- 
on enzymes Xho1 and EcoR1 , and ligated into pEGFP-C1 (Glontech, Palo Alto; GenBank 
Accession number U55763) digested with Xho1 and EcoR1. This produces an EGFP-PTP 
fusion (SEQ ID NO:1 16 &1 17) under the control of a GMV promoter. 
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b) The human protein-tyrosine phosphatase 1C gene (GenBank Accession number: X62055) 
is amplified using PGR according to standard protocols with primers PTP-top (SEQ ID 
NO:99) and PTP-bottom/-stop (SEQ ID NO: 100). The PGR product is digested with restricti- 
on enzymes Xho1 and EcoR1, and ligated into pEGFP-N1 (Glontech. Palo Alto; GenBank 
Accession number U55762) digested with Xho1 and EcoR1 . This produces a PTP-EGFP 
fusion (SEQ ID NO: 11 8 &1 19) under the control of a GMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. MGF-7 in which the 
EGFP-PTP probe and/or the PTP-EGFP probe should change its cellular distribution from 
cytoplasm to the plasma menbrane within minutes in response to activation of the growth 
inhibitory signalling pathway with e.g. somatostatin. 

Example 17: 

Probes for detection of Smad4 redistribution. 

Useful for monitoring signalling pathways involving most members of the transforming 
growth factor-beta family, e.g. to identify compounds which modulate the activity of the pa- 
thway in living cells. 

Smad4, a signal transducer, is a common component of signalling pathways induced by va- 
rious members of the TGFbeta family of cytokines. 

a) The human Smad4 gene (GenBank Accession number: U44378) was amplified using 
PGR according to standard protocols with primers Smad4-top and Smad4-bottom/+stop 
(SEQ ID NO:35) . The PGR product was digested with restriction enzymes EcoRI and 
BamH1, and ligated into pEGFP-C1 (Glontech, Palo Alto; GenBank Accession number 
U55763) digested with EcoRI and BamH1. This produce an EGFP-Smad4 fusion (SEQ ID 
NO:52 &53) under the control of a GMV promoter. 

b) The human Smad4 gene (GenBank Accession number: U44378) was amplified using 
PGR according to standard protocols with primers Smad4-top (SEQ ID NO:33) and Smad4- 
bottom/-stop (SEQ ID NO:34). The PGR product was digested with restriction enzymes E- 
coR1 and BamH1, and ligated into pEGFP-N1 (Glontech, Palo Alto; GenBank Accession 
number U55762) digested with EcoRI and BamHI. This produced a Smad4-EGFP fusion 
(SEQ ID NO:76 &77) under the control of a GMV promoter. 
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The resulting plasmlds are transfected into a cell line. e.g. HEK293 in which the EGFP- 
Smad4 probe and/or the Smad4-EGFP probe should change Its cellular distribution within 
minutes from cytoplasmic to nuclear in response to activation of the signalling pathway with 
e.g. TGFbeta. 

5 

Example 18: 

Probes for detection of Stat5 redistribution. 

Useful for monitoring signalling pathways involving the activation of tyrosine kinases of the 
Jak family, e.g. to identify compounds which modulate the activity of the pathway in living 
10 cells. 

Stats, signal transducer and activator of transcription, is a component of signalling pathways 
which are induced by e.g. many cytokines and growth factors. 



a) The human StatS gene (GenBank Accession number: L41 142) was amplified using PGR 
15 according to standard protocols with primers Stat5-top (SEQ ID NO:30) and StatS- 

bottom/+stop (SEQ ID NO:32). The PGR product was digested with restriction enzymes Bgl2 
and Acc65l. and ligated into pEGFP-CI (Clontech; Palo Alto; GenBank Accession number 
U55763) digested with Bgl2 and Acc65l. This produced an EGFP-Stat5 fusion (SEQ ID 
NO:54 &55) under the control of a CMV promoter. 

20 b) The human StatS gene (GenBank Accession number: L41 142) was amplified using PGR 
according to standard protocols with primers StatS-top (SEQ ID NO:30) and StatS-bottom/- 
stop (SEQ ID NO:331). The PGR product was digested with restriction enzymes Bgl2 and 
Acc65l, and ligated into pEGFP-NI (Clontech, Palo Alto; GenBank Accession number 
U55762) digested with Bgl2 and Acc65l. This produced a StatS-EGFP fusion (SEQ ID NO:78 

25 &79) under the control of a CMV promoter. 

The resulting plasmlds are transfected into a suitable cell line, e.g. MIN6 in which the EGFP- 
Stats probe and/or the StatS-EGFP probe should change its cellular distribution from cyto- 
plasmic to nuclear within minutes in response to activation signalling pathway with e.g. pro- 
lactin. 
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Example 19: 

Probes for detection of NFAT redistribution. 

Useful for monitoring signalling pathways involving activation of NFAT, e.g. to identify com- 
pounds which modulate the activity of the pathway in living cells. 

5 NFAT, an activator of transcription. Is a component of signalling pathways which is involved 
in e.g. immune responses. 



a) The human NFAT1 gene (GenBank Accession number: U43342) is amplified using PGR 
according to standard protocols with primers NFAT-top (SEQ ID NO:84) and NFAT- 

10 bottom/+stop (SEQ ID NO?86). The PGR product is digested with restriction enzymes Xho1 
and EcoR1 , and ligated Into pEGFP-C1 (Clontech, Palo Alto; GenBank Accession number 
U55763) digested with Xhol and EcoR1 . This produces an EGFP-NFAT fusion (SEQ ID 
NO: 130 &131) under the control of a GMV promoter. 

b) The human NFAT gene (GenBank Accession number: U43342) is amplified using PGR 
15 according to standard protocols with primers NFAT-top (SEQ ID NO:84) and NFAT-bottom/- 

stop (SEQ ID NO:85). The PGR product is digested with restriction enzymes Xho1 and E- 
coRI, and ligated into pEGFP-NI (Clontech, Palo Alto; GenBank Accession number 
U55762) digested with Xhol and EcoR1. This produces an NFAT-EGFP fusion (SEQ ID 
NO:132 &133) under the control of a GMV promoter 

20 The resulting plasmids are transfected into a suitable cell line, e.g. Jurkat, in which the 
EGFP-NFAT probe and/or the NFAT-EGFP probe should change Its cellular distribution 
from cytoplasmic to nuclear within minutes in response to activation of the signalling pathway 
with e.g. antibodies to CD3epsilon. 



25 Example 20: 

Probes for detection of NFkappaB redistribution. 

Useful for monitoring signalling pathways leading to activation of NFkappaB, e.g. to identify 
compounds which modulate the activity of the pathway in living cells. 
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NFkappaB. an activator of transcription, Is a component of signalling pathways which are 
responsive to a varity of inducers including cytokines, lymphokines. some immunosuppressi- 
ve agents. 



a) The human NFkappaB p65 subunit gene (GenBank Accession number: M62399) is am- 
plified using PGR according to standard protocols with primers NFkappaB-top (SEQ ID 
NO:87) and NFkappaB-bottom/+stop (SEQ ID NO:89). The PGR product is digested with 
restriction enzymes Xhol and BamH1, and ligated into pEGFP-G1 (Glontech. Palo Alto; 
GenBank Accession number U55763) digested with Xho1 and BamHI. This produces an 
EGFP-NFkappaB fusion (SEQ ID NO:142 & 143) under the control of a CMV promoter. 

b) The human NFkappaB p65 subunit gene (GenBank Accession number: M62399) is am- 
plified using PGR according to standard protocols with primers NFkappaB-top (SEQ ID 
NO:87) and NFkappaB-bottom/-stop (SEQ ID NO:88). The PGR product is digested with re- 
striction enzymes Xhol and BamHI. and ligated into pEGFP-NI (Glontech. Palo Alto; Gen- 
Bank Accession number U55762) digested with Xhol and BamHI. This produces an 
NFkappaB-EGFP fusion (SEQ ID NO:140 & 141) under the control of a GMV promoter. 
The resulting plasmids are transfected Into a suitable cell line. e.g. Jurt<at. In which the 
EGFP-NFkappaB probe and/or the NFkappaB-EGFP probe should change its cellular distri- 
bution from cytoplasmic to nuclear in response to activation of the signalling pathway with 
e.g. TNFalpha. 

Example 21: 

Probe for detection of RhoA redistribution. 

Useful for monitoring signalling pathways involving RhoA. e.g. to identify compounds which 
modulate the activity of the pathway in living cells. 

RhoA. a small GTPase, is a component of many signalling pathways, e.g. LPA induced cyto- 
skeietal rearrangements. 



The human RhoA gene (GenBank Accession number: L25080) was amplified using PGR 
according to standard protocols with primers RhoA-top (SEQ ID NO:92) and RhoA- 
bottom/+stop (SEQ ID NO:93). The PGR product was digested with restriction enzymes 
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Hind3 and BamH1. and ligated into pEGFP-C1 (Clontech, Palo Alto; GenBank Accession 
number U55763) digested with Hind3and BamH1. This produced an EGFP-RhoA fusion 
(SEQ ID NO:126 &127) under the control of a CMV promoter. 

The resulting plasmid is transfected into a suitable cell line. e.g. SwissSTS, In which the 
5 EGFP-RhoA probe should change Its cellular distribution from a reasonably homogenous to 
a peripheral distribution within minutes of activation of the signalling pathway with e.g. LPA. 

Example 22: 

Probes for detection of PKB redistribution. 

Useful for monitoring signalling pathways involving PKB e.g. to Identify compounds which 
10 modulate the activity of the pathway in living cells. 

PKB. a serine/threonine kinase, is a component in various signalling pathways, many of 
which are activated by growth factors. 



a) The human PKB gene (GenBank Accession number: M63167) is amplified using PGR ac- 
15 cording to standard protocols with primers PKB-top (SEQ ID NO:36) and PKB-bottom/+stop 
(SEQ ID NO:80) . The PGR product is digested with restriction enzymes Xhol and BamHI. 
and ligated into pEGFP-CI (Clontech, Palo Alto; GenBank Accession number U55763) dige- 
sted with Xho1 and BamHI. This produces an EGFP-PKB fusion (SEQ ID NO:138 & 139) 
under the control of a CMV promoter. 

20 b) The human PKB gene (GenBank Accession number: M63167) was amplified using PGR 
according to standard protocols with primers PKB-top (SEQ ID NO:36) and PKB-bottom/- 
stop (SEQ ID NO:37) . The PGR product was digested with restriction enzymes Xho1 and 
BamHI, and ligated into pEGFP-NI (Clontech, Palo Alto; GenBank Accession number 
U55762) digested with Xho1 and BamHI. This produced a PKB-EGFP fusion (SEQ ID 

25 NO:70 &71) under the control of a CMV promoter. 

The resulting plasmids are transfected Into a suitable cell line, e.g. CHO expressing the hu- 
man insulin receptor, in which the EGFP-PKB probe and/or the PKB-EGFP probe cycles 
between cytoplasmic and membrane locations during the activation-deactivation process 
following addition of insulin. The transition should be apparent within minutes. 
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SEQUENCE LISTING 



10 



15 



20 



25 



30 



35 



40 



(1) GENERAL INFORMATION 
(i) APPLICANT: NovoNordisk, Biolmage 

(ii) TITLE OF THE INVENTION: A Method of Detecting Cellular 

Translocation of Biologically Active Polypeptides Using 
Fluorescense Imaging 

(iii) NUMBER OF SEQUENCES: 143 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NovoNordisk, Biolmage 

(B) STREET: Morkhojbygade 28 

(C) CITY: Seborg 

(D) STATE: DK 

(E) COUNTRY: DENMARK 

(F) ZIP: 2860 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: , PV&P R 

(B) REGISTRATION NUMBER: 

(C) REFERENCE /DOCKET NUMBER: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ACAA GCTTTGGACA CGGCGCGCCA TGAGTAAAGG AGAAGAACTT TTC 53 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GTCATCTTCT CGAGTCTTAC TCCTGAGGTT TGTATAGTTC ATCCATGCCA TGT 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TTGGACACAA GCTTTGGACA CCCTCAGGAT ATGGGCAACG CCGCCGCCGC CAAG 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHTUIACTERISTICS : 

(A) LENGTH: 5 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
GTCATCTTCT CGAGTCTTTC AGGCGCGCCC AAACTCAGTA AACTCCTTGC CACAC 55 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TTGGACACAA GCTTTGGACA CCCTCAGGAT ATGGCTGACG TTTACCCGGC CAACG 55 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GTCATCTTCT CGAGTCTTTC AGGCGCGCCC TACTGCACTT TGCAAGATTG GGTGC 55 
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(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

TTGGACACAA GCTTTGGACA CCCTCAGGAT ATGGCGGCGG CGGCGGCGGC TCCGGGGGGC 60 
GGGG 64 



(2) INFORMATION FOR SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 55 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

GTCATCTTCT CGAGTCTTTC AGGCGCGCCC GGGGCCCTCT GGCGCCCCTG GCTGG 55 
(2) INFORMATION FOR SEQ ID NO: 9: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

40 TAGAATTCAA CCATGGCGGC GGCGGCGGCG 30 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TAGGATCCCT AGGGGGCCTC CAGCACTCC 29 
55 (2) INFORMATION FOR SEQ ID NO: 11: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11; 
TACTCGAGTA ACCATGGCGG CGGCGGCGGC G 

31 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 12 : 
TAGGATCCAT AGATCTGTAT CCTGG 

25 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 
TAGGATCCTT AAGATCTGTA TCCTGG 

26 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATCTCGAGGG AAAATGTCTC AGGAGAGG 

28 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



10 



15 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ATGGATCCTC GGACTCCATC TCTTCTTG 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ATGGATCCTC AGGACTCCAT CTCTTCTTG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



28 



29 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GTCTCGAGCC ATCATGAGCA GAAGCAAG 

(2) INFORMATION FOR SEQ ID NO: 18: 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



28 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GTGGATCCCA CTGCTGCACC TGTGCTA 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
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GTGGATCCTC ACTGCTGCAC CTGTGCTA 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CGCGAATTCC GCCACCATGA GTGCTGAGGG GTACCAGTAC 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CGCGGATCCT GTCGCCTCTG CTGTGCATAT AC 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s ingl e 

(D) TOPOLOGY: linear 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: p85-top-C 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

GGGAGATCTA TGAGTGCTGA GGGGTACCAG 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GGGCGGATCC TCATCGCCTC TGCTGTGCAT ATAC 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GTGAATTCGA CCATGTCGTC CATCTTGCCA TTC 
15 (2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GTGGTACCCA TGACATGCTT GAGCAACGCA C 

(2) INFORMATION FOR SEQ ID NO: 26: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



40 



(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 26: 
GTGGTACCTT ATGACATGCT TGAGCAACGC AC 

(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GTGAATTCGT CAATGGAGCT GGAAAACATC G 

(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH; 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GTGGATCCCT GCTGCTTCCG GTGGAGTTCG 

30 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GTGGATCCCT AGCTGCTTCC GGTGGAGTTC G 

31 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GTAGATCTAC CATGGCGGGC TGGATCCAGG CC 

32 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GTGGTACCCA TGAGAGGGAG CCTCTGGCAG A 

31 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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10 



15 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32; 

GTGGTACCTC ATGAGAGGGA GCCTCTGGCA G 
V 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
GTGAATTCAA CCATGGACAA TATGTCTATT ACG 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



31 



33 



30 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GTGGATCCCA GTCTAAAGGT TGTGGGTCTG C 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GTGGATCCTC AGTCTAAAGG TTGTGGGTCT GC 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



31 



32 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
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66 

GTCTCGAGGC ACCATGAGCG ACGTGGC 

27 

(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

TGGGATCCGA GGCCGTGCTG CTGGCCG 
t 27 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1896 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1891 
(D) OTHER INFORMATION: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



mI^ '"'^'^ ^'"'^ GGG GTG GTG CCC ATC CTG 

Met Val ser Lys Gly Glu Glu Leu Phe Thr Gly Val vlt Pro lie Leu 
^ 10 



48 

5 Leu 

15 



GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 
val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser vTl sS aiy 
2a 25 



30 



^ Z TZ -I S Z'. S^.^.^ - 



40 45 



^. S ?S S S 5S ^ S ?S i^. 



55 60 



Su tly Tn 11"" 

Tnr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 



96 



144 



192 



240 



75 80 



2^ 2t -I I- -I r Z I- Zt X 

90 95 
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CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAG AAG ACC CGC GCC GAG 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

5 GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr beu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAG 432 
10 He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
15 145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

20 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

25 CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
30 Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
35 225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCA ACC ATG GCG GCG GCG 768 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Thr Met Ala Ala Ala 
245 250 255 

40 

GCG GCT CAG GGG GGC GGG GGC GGG GAG CCC CGT AGA ACC GAG GGG GTC 816 

Ala Ala Gin Gly Gly Gly Gly Gly Glu Pro Arg Arg Thr Glu Gly Val 
260 265 270 

45 GGC CCG GGG GTC CCG GGG GAG GTG GAG ATG GTG AAG GGG CAG CCG TTC 864 
Gly Pro Gly Val Pro Gly Glu Val Glu Met Val Lys Gly Gin Pro Phe 
275 280 285 

GAC GTG GGC CCG CGC TAC ACG CAG TTG CAG TAC ATC GGC GAG GGC GCG 912 
50 Asp Val Gly Pro Arg Tyr Thr Gin Leu Gin Tyr He Gly Glu Gly Ala 
290 295 300 

TAC GGC ATG GTC AGC TCG GCC TAT GAC CAC GTG CGC AAG ACT CGC GTG 960 
Tyr Gly Met Val Ser Ser Ala Tyr Asp His Val Arg Lys Thr Arg Val 
55 305 310 315 320 
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s s c» 

-0 S S S S - s S S S - s 21? ^ 

380 

i - =s s s r s jL= z s 

^.^ 400 

460 

CCA GAG ATC ATG CTG AAC TCP aar- 

- 0.. S 3^ - - - - - 

4 75 

480 

TGG TCT GTG GGC TGC ATT CTr nn-v 

Trp se. val o^y cys S ^2 SI ^Ju JI? l^r T 

4 85 Asn Arg Pro lie 

490 

s 0=.- ^. 2! s - 

540 



1056 



1104 



1200 



1248 



1296 



1536 



1584 
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GCC AAG CTT TTC CCC AAG TCA GAG TCC AAA GCC CTT GAG CTG CTG GAG 1680 

Ala Lys Leu Phe Pro Lys Ser Asp Ser Lys Ala Leu Asp Leu Leu Asp 
545 550 555 560 

5 CGG ATG TTA ACC TTT AAC CCC AAT AAA CGG ATC ACA GTG GAG GAA GCG 172 B 

Arg Met Leu Thr Phe Asn Pro Asn Lys Arg lie Thr Val Glu Glu Ala 
565 570 575 

CTG GCT CAC CCC TAG CTG GAG CAG TAG TAT GAC CGG ACG GAT GAG CCA 1776 

10 Leu Ala His Pro Tyr Leu Glu Gin Tyr Tyr Asp Pro Thr Asp Glu Pro 
580 585 590 

GTG GCC GAG GAG CCC TTC ACC TTC GCC ATG GAG CTG GAT GAC CTA CCT 1824 

Val Ala Glu Glu Pro Phe Thr Phe Ala Met Glu Leu Asp Asp Leu Pro 
15 595 600 605 

AAG GAG CGG CTG AAG GAG CTC ATC TTC CAG GAG ACA GCA CGC TTC CAG 1872 

Lys Glu Arg Leu Lys Glu Leu lie Phe Gin Glu Thr Ala Arg Phe Gin 

610 615 620 



20 



25 



CCC GGA GTG CTG GAG GCC C CCTAG 18 96 

Pro Gly Val Leu Glu Ala Pro 
625 630 



(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 631 amino acids 

30 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
35 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 





Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu 


Leu 


Phe 


Thr 


Gly 


Val 


Val 


Pro 


He 


Leu 


40 


1 








5 










10 










15 






Val 


Glu 


Leu 


Asp 
20 


Gly 


Asp 


Val 


Asn 


Gly 
25 


His 


Lys 


Phe 


Ser 


Val 
30 


Ser 


Gly 




Glu 


Gly 


Glu 
35 


Gly 


Asp 


Ala 


Thr 


Tyr 
40 


Gly 


Lys 


Leu 


Thr 


Leu 
45 


Lys 


Phe 


He 


45 


Cys 


Thr 
50 


Thr 


Gly 


Lys 


Leu 


Pro 
55 


Val 


Pro 


Trp 


Pro 


Thr 
60 


Leu 


Val 


Thr 


Thr 




Leu 


Thr 


Tyr 


Gly 


Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 


Pro 


Asp 


His 


Met 


Lys 
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70 










75 










80 




Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly Tyr 


Val 


Gin 


Glu 


50 










85 










90 










95 






Arg 


Thr 


He 


Phe 
100 


Phe 


Lys 


Asp 


Asp 


Gly 
105 


Asn 


Tyr 


Lys 


Thr 


Arg 
110 


Ala 


Glu 




Val 


Lys 


Phe 
115 


Glu 


Gly 


Asp 


Thr 


Leu 
120 


Val 


Asn 


Arg 


■ He 


Glu 
125 


Leu 


Lys 


Gly 


55 


He 


Asp 
130 


Phe 


Lys 


Glu 


Asp 


Gly 
135 


Asn 


He 


Leu 


Gly 


His 
14 0 


Lys 


Leu 


Glu 


Tyr 
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Asn Tyr Asn Ser His Asn Val Tvr He MP^ ^ t 
145 ^ ^ ^sp Lys Gin Lys Asn 



Oly lie Lys Val Asn Phe Lys He Arg Hi. Asn He Glu Asp Gly Ser 



160 



165 

170 



val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Gly 

Pro val Leu Leu Pro Asp Asn His JJr Leu Ser Thr Gin III Ala Leu 

200 205 
ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

val Thr Ala Ala Gly He HI Leu Gly „et Asp Olu Leu Tyr Lys Ser 



Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn III Thr Met Ala Ala III 



245 250 



Ala Ala Gin Gly Gly Gly Gly Gly Glu ;ro Arg Arg Thr Glu g" Val 

265 

Gly Pro Gly Val Pro Gly Glu Val Glu Met Val Lys Gly Gin Pro Phe 

280 285 
ASP val Gly Pro Arg Tyr Thr Gin Leu Gin Tyr He aly Glu Gly Ala 

Tyr Gly Met Val Ser Ser All ryr Asp His Val 11^ Lys Thr Arg Val 

Ala He Lys Lys lie Ser Pro Phe Glu His Gl'n Thr Tyr Cys Gin I^g 

Thr Leu Arg Glu He Gin He Leu Leu l^g Phe Arg His Glu Asn Val 

lie Gly lie Arg Asp He Leu Arg 111 ser Thr Leu Glu "a Met Arg 

ASP val Tyr He Val Gin Asp III Met Glu Thr Asp ^eu Tyr Lys Leu 



380 



Leu Lys ser Gin Gin Leu Ser Asn Asp His He Cys Tyr Phe Leu ^r 



390 



— - 395 
Gin He Leu Arg Gly Leu Lys Tyr He His Ser Ala Asn Val Leu hTs 

410 



Arg ASP Leu Lys Pro Ser Asn Leu Leu Ser Asn Thr Thr Cys Asp Leu 

Lys He cys Asp Phe Gly Leu Ala S He Ala Asp Pro Oil His Asp 

His Thr Gly Phe Leu Thr Glu "^l Val Ala Thr Arg Z Tyr Arg Ala 

460 

Pro Glu He Met Leu Asn Ser Lys Gly Tyr Thr Lys Ser He Asp He 



475 



Trp ser Val Gly Cys He Leu Ala Glu Met Leu Ser Asn Arg Pro He" 

4 85 



490 



Phe Pro Gly Lys His Tyr Leu Asp Gin Z^u Asn His He Leu Gly He 



505 



Leu Gly ser Pro Ser Gin Glu Asp L^;! Asn Cys He He til Met Lys 



520 



Ala Arg Asn Tyr Leu Gin Ser llu Pro Ser Lys Thr l^l Val Ala Trp 

540 

Ala Lys Leu Phe Pro Lys Ser Asp Ser Lys Ala Leu Asp Leu Leu Asp 

Arg Met Leu Thr Phe Asn Pro Asn Lys Arg H^e Thr Val Glu Glu III 

570 

Leu Ala His pro Tyr Leu Glu Gin Tyr Tyr Asp Pro Thr Asp gIu Pro 



585 



590 
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Val Ala Glu Glu Pro Phe Thr Phe Ala Met Glu Leu Asp Asp Leu Pro 

595 600 605 

Lys Glu Arg Leu Lys Glu Leu He Phe Gin Glu Thr Ala Arg Phe Gin 
610 615 620 

5 Pro Gly Val Leu Glu Ala Pro 
625 630 

(2) INFORMATION FOR SEQ ID NO: 40: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1818 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: CDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 
20 (B) LOCATION: 1. . .1815 

(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

25 ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
30 Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

40 

CTG ACC TAG GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

45 CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC T^C TAC AAG ACC CGC GCC GAG 3 36 

50 Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 XIO 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
55 115 120 125 
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ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 
He ASP Phe Lys Glu Asp Gly Asn He Leu Gly His I.ys Su gJu 



"5 



AAC TAC AAC AGC CAC AAC GTC TAT ATC ATO GCC GAC AAG CAG AAG AAC 

Asn Tyr Asn Ser His Asn Val Tvr He M«^ ai= » r , 

145 '^^ Ala Asp Lys Gin Lys Asn 

"° "5 160 

Z S ^. S - J- J- s z s s 

170 

Sn Su 111 ^^"^ '''^^ "^^^ '^'^^ GGC 

val Gin Leu Ala Asp Hxs Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

185 190 

Pro ?Il Su p"""" f ''^^ ■'^^ ^ -rCC GCC CTG 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 
ser Lys Asp Pro Asn Glu Lys Arg Asp His Met VaJ Su Leu pJe 

220 

?S 111 S G?v rT TH^ "° 
^al Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 

"° 235 240 

^ ill f '''''' GCG GCG GGC CCG 

Gly Leu Arg Ser Arg Val Thr Met Ala Ala Ala Ala Ala Ala Gly Pro 

2« 250 255 

gJu f T ^■'^ TAC ACT AAT 

Glu Met val Arg Gly Gin Val Phe Asp Val Gly Pro Arg Tyr Thr 

265 270 

CTC TCG TAC ATC GGA GAA GGC GCC TAC GGC ATG GTT TGT TCT GCT TAT 
Leu ser Tyr He Gly Glu Gly Ala Tyr Gly Met Val s" SI ^ 

280 285 

GAT AAT CTC AAC AAA GTT CGA GTT GCT ATC AAG AAA ATC AGT CCT TTT 
ASP Asn Leu Asn Lys Val Arg Val Ala He Lys Lys He Ser Pro 



300 



GAG CAC CAG ACC TAC TGT CAG AGA ACC CTG AGA GAG ATA AAA ATC CTA 
Glu H.S Gin Thr Tyr Cys Gin Arg Thr .eu Arg Glu S ^ Til llu 

315 320 

III Arg III ^rt His T T T -<^- 
u Arg Phe Arg Has Glu Asn He He Gly He Asn Asp He He Arg 

aS Tro ?hr tH T ^ ^™ GAC CTC 

Ala Pro Thr He Glu Gin Met Lys Asp Val Tyr He Val Gin Asp Leu 

345 350 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 
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ATG GAG ACA GAT CTT TAG AAG CTC TTG AA6 ACA CAG CAC CTC AGC AAT 1104 

Met Glu Thr Asp Leu Tyr Lys Leu Leu Lys Thr Gin His Leu Ser Asn 

355 360 365 

5 GAT CAT ATC TGC TAT TTT CTT TAT CAG ATC CTG AG A GGA TTA AAG TAT 1152 
Asp His lie Cys Tyr Phe Leu Tyr Gin lie Leu Arg Gly Leu Lys Tyr 
370 375 380 

ATA CAT TCA GCT AAT GTT CTG CAC CGT GAC CTC AAG CCT TCC AAC CTC 1200 
10 lie His Ser Ala Asn Val Leu His Arg Asp Leu Lys Pro Ser Asn Leu 
385 390 395 400 

CTG CTG AAC ACC ACT TOT GAT CTC AAG ATC TGT GAC TTT GGC CTT GCC 1248 
Leu Leu Asn Thr Thr Cys Asp Leu Lys lie Cys Asp Phe Gly Leu Ala 
15 405 410 415 

CGT GTT GCA GAT CCA GAC CAT GAT CAT ACA GGG TTC TTG ACA GAG TAT 12 96 

Arg Val Ala Asp Pro Asp His Asp His Thr Gly Phe Leu Thr Glu Tyr 

420 425 430 

20 

GTA GCC ACG CGT TGG TAC AGA GCT CCA GAA ATT ATG TTG AAT TCC AAG 1344 

Val Ala Thr Arg Trp Tyr Arg Ala Pro Glu lie Met Leu Asn Ser Lys 

435 440 445 

25 GGT TAT ACC AAG TCC ATT GAT ATT TGG TCT GTG GGC TGC ATC CTG GCA 13 92 

Gly Tyr Thr Lys Ser lie Asp lie Trp Ser Val Gly Cys lie Leu Ala 
450 455 460 

GAG ATG CTA TCC AAC AGG CCT ATC TTC CCA GGA AAG CAT TAC CTT GAC 1440 
30 Glu Met Leu Ser Asn Arg Pro lie Phe Pro Gly Lys His Tyr Leu Asp 
465 470 475 480 

CAG CTG AAT CAC ATC CTG GGT ATT CTT GGA TCT CCA TCA CAG GAA GAT 1488 
Gin Leu Asn His lie Leu Gly lie Leu Gly Ser Pro Ser Gin Glu Asp 
35 485 490 495 

CTG AAT TGT ATA ATA AAT TTA AAA GCT AGA AAC TAT TTG CTT TCT CTC 1536 
Leu Asn Cys lie lie Asn Leu Lys Ala Arg Asn Tyr Leu Leu Ser Leu 
500 505 510 

40 

CCG CAC AAA AAT AAG GTG CCG TGG AAC AGG TTG TTC CCA AAC GCT GAC 1584 
Pro His Lys Asn Lys Val Pro Trp Asn Arg Leu Phe Pro Asn Ala Asp 
515 520 525 

45 TCC AAA GCT CTG GAT TTA CTG GAT AAA ATG TTG ACA TTT AAC CCT CAC 1632 
Ser Lys Ala Leu Asp Leu Leu Asp Lys Met Leu Thr Phe Asn Pro His 
530 535 540 

AAG AGG ATT GAA GTT GAA CAG GCT CTG GCC CAC CCG TAC CTG GAG CAG 1680 
50 Lys Arg lie Glu Val Glu Gin Ala Leu Ala His Pro Tyr Leu Glu Gin 
545 550 555 560 

TAT TAT GAC CCA AGT GAT GAG CCC ATT GCT GAA GCA CCA TTC AAG TTT 1728 
Tyr Tyr Asp Pro Ser Asp Glu Pro He Ala Glu Ala Pro Phe Lys Phe 
55 565 570 575 
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A^^ m''^ n^^ r'^'' "'"'^ ^'"'^ ™ ^« CTC AAA GAA CTC ATT 

ASP Met Glu Leu Asp Asp Leu Pro Lys Glu Lys Leu Lys Glu Leu ill 

585 

TTT GAA GAG ACT GCT CGA TTC CAG CCA GGA TAG AGA TCT TAA 
Phe Glu Glu Thr Ala Arg Phe Gin Pro Gly Tyr Arg Ser 

600 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Met Val ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

5 10 -1 5 

val Glu Leu ASP Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

Glu Gly Glu Gly Asp Ala Thr Tyr lly Lys Leu Thr Leu Its Phe He 

Cys Thr Thr Gly Lys Leu Pro til Pro Trp Pro Thr tlu Val Thr Thr 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

Gin His ASP Phe Phe Lys Ser Ala Met Pro gL Gly Tyr Val Gin gL 

Arg Thr He Phe Phe Lys Asp Asp Gly IL Tyr Lys Thr Arg "a Glu 

105 1 
val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Le^ Lys Gly 

He ASP Phe Lys Glu Asp Gly Isn He Leu Gly His lyl Leu Glu Tyr 

135 140 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

Gly He Lys Val Asn Phe Lys He Arg His As" He Glu Asp Gly Ser 

val Gin Leu Ala Asp His Tyr Gin Gin III Thr Pro He Gly Asp Gly 

185 I9n 
Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 205 
Ser Lys Asp Pro Asn Glu Lys Ara Ast^ h4 o m->«- t7-.-i t 

^ ^^P Met Val Leu Leu Glu Phe 

215 220 
val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 

Gly Leu Arg Ser Arg Val Thr Met Ala Ala a" Ala Ala Ala Gly P^o 

Glu Met val Arg Gly Gin Val Phe Asp III aiy Pro Arg Tyr "r Asn 

265 270 



1776 



1818 
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10 



15 



20 



25 



30 



35 



40 



45 
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75 



Leu 


Ser 


Tyr 


He 


Gly Glu 


Gly Ala 


Tyr 


Gly Met 


Val 


Cys 


Ser 


Ala 


Tyr 






275 










280 










285 








Asp 


Asn 


Leu 


Asn 


Lys 


Val 


Arg 


Val 


Ala 


He 


Lys 


Lys 


He 


Ser 


Pro 


Phe 




290 










295 










300 










Glu 


His 


Gin 


Thr 


Tyr 


Cys 


Gin Arg 


Thr 


Leu 


Arg 


Glu 


He 


Lys 


He 


Leu 


305 










310 










315 










320 


Leu 


Arg 


Phe 


Arg 


His 


Glu 


Asn 


He 


He 


Gly 


He 


Asn Asp 


He 


He 


Arg 










325 










330 










335 




Ala 


Pro 


Thr 


He 


Glu 


Gin 


Met 


Lys 


Asp 


Val 


Tyr 


He 


Val 


Gin 


Asp 


Leu 








340 










345 










350 






Met 


Glu 


Thr 


Asp 


Leu 


Tyr 


Lys 


Leu 


Leu 


Lys 


Thr 


Gin 


His 


Leu 


Ser 


Asn 






355 










360 










365 








Asp 


His 


He 


Cys 


Tyr 


Phe 


Leu 


Tyr 


Gin 


He 


Leu 


Arg 


Gly 


Leu 


Lys 


Tyr 




370 










375 










380 










He 


His 


Ser 


Ala 


Asn 


Val 


Leu 


His 


Arg 


Asp 


Leu 


Lys 


Pro 


Ser 


Asn 


Leu 


385 










390 










395 










400 


Leu 


Leu 


Asn 


Thr 


Thr 


Cys 


Asp 


Leu 


Lys 


He 


Cys 


Asp 


Phe 


Gly 


Leu 


Ala 










405 










410 










415 




Arg 


Val 


Ala 


Asp 


Pro 


Asp 


His 


Asp 


His 


Thr 


Gly 


Phe 


Leu 


Thr 


Glu 


Tyr 








420 










425 










430 






Val 


Ala 


Thr 


Arg 


Trp 


Tyr 


Arg 


Ala 


Pro 


Glu 


He 


Met 


Leu 


Asn 


Ser 


Lys 






435 










440 










445 








Gly Tyr 


Thr 


'Lys 


Ser 


He 


Asp 


He 


Trp 


Ser 


Val 


Gly 


Cys 


He 


Leu 


Ala 




450 










455 










460 










Glu 


Met 


Leu 


Ser 


Asn 


Arg 


Pro 


He 


Phe 


Pro 


Gly 


Lys 


His 


Tyr 


Leu 


Asp 


465 










470 










475 










480 


Gin 


Leu 


Asn 


His 


He 


Leu 


Gly 


He 


Leu Gly Ser 


Pro 


Ser 


Gin 


Glu 


Asp 










485 










490 










495 




Leu 


Asn 


Cys 


He 


He 


Asn 


Leu 


Lys 


Ala 


Arg 


Asn 


Tyr 


Leu 


Leu 


Ser 


Leu 








500 










505 










510 






Pro 


His 


Lys 


Asn 


Lys 


val 


Pro 


Trp 


Asn 


Arg 


Leu 


Phe 


Pro 


Asn 


Ala 


Asp 






515 










520 










525 








Ser 


Lys 


Ala 


Leu 


Asp 


Leu 


Leu 


Asp 


Lys 


Met 


Leu 


Thr 


Phe 


Asn 


Pro 


His 




530 










535 










540 










Lys 


Arg 


He 


Glu 


Val 


Glu 


Gin 


Ala 


Leu 


Ala 


His 


Pro 


Tyr 


Leu 


Glu 


Gin 


545 










550 










555 










560 


Tyr 


Tyr 


Asp 


Pro 


Ser 


Asp 


Glu 


Pro 


He 


Ala 


Glu 


Ala 


Pro 


Phe 


Lys 


Phe 










565 










570 










575 




Asp 


Met 


Glu 


Leu 


Asp 


Asp 


Leu 


Pro 


Lys 


Glu 


Lys 


Leu 


Lys 


Glu 


Leu 


He 








580 










585 










590 






Phe 


Glu 


Glu 


Thr 


Ala 


Arg 


Phe 


Gin 


Pro 


Gly 


Tyr 


Arg 


Ser 












595 










600 










605 









(2) INFORMATION FOR SEQ ID NO: 42: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 52 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
50 <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

55 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2526 



75 
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76 

(D) OTHER INFORMATION: 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 



15 



- 21 S -I S S - 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu JJe 
" 40 45 

Z S ^ ™ s - - - ^ - - 

?s s?.^ z. s s s 



75 



Sn Hi« f TAC GTC CAG GAG 
Gin HX8 ASP Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val oTn Glu 



85 



90 



S Jhr ne pJe Z"" "'^^ '^^'^ °AG 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

105 



110 



?I? z ?s zi r s s ? s ;^ 



125 



ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 
ne ASP Phe Lys Glu Asp Gly Asn xle Leu Gly His Hu G^u Sr 



"5 



J^n 0°"" "^"^ '^'^ <^CC GAC AAG CAG AAG AAC 

Asn TVr Asn Ser His Asn Val Tyr He Met Ala Asp Lys oil 

150 



160 



S ill til f ir GGC AGC 

Gly lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

1" 170 

vl? ""^^ CAG AAC ACC CCC ATC GGC GAC GGC 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He fly 2p tly 



185 



190 



CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC era 
pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Sn ^r 111 S 

200 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



205 
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AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

5 GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 72 0 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCG TCA ATG GAG CTG GAA 768 
10 Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ser Met Glu Leu Glu 

245 250 255 

AAC ATC GTG GCC AAC ACG GTC TTG CTG AAA GCC AGG GAA GGG GGC GGA 816 
Asn lie Val Ala Asn Thr Val Leu Leu Lys Ala Arg Glu Gly Gly Gly 
15 260 265 270 

GGA AAG CGC AAA GGG AAA AGC AAG AAG TGG AAA GAA ATC CTG AAG TTC 864 

Gly Lys Arg Lys Gly. Lys Ser Lys Lys Trp Lys Glu lie Leu Lys Phe 
275 280 285 

20 

CCT CAC ATT AGC CAG TGT GAA GAC CTC CGA AGG ACC ATA GAC AGA GAT 912 

Pro His lie Ser Gin Cys Glu Asp Leu Arg Arg Thr lie Asp Arg Asp 

290 295 300 

25 TAC TGC AGT TTA TGT GAC AAG CAG CCA ATC GGG AGG CTG CTT TTC CGG 960 
Tyr Cys Ser Leu Cys Asp Lys Gin Pro lie Gly Arg Leu Leu Phe Arg 
305 310 315 320 

CAG TTT TGT GAA ACC AGG CCT GGG CTG GAG TGT TAC ATT CAG TTC CTG 1008 
30 Gin Phe Cys Glu Thr Arg Pro Gly Leu Glu Cys Tyr lie Gin Phe Leu 

325 330 335 

GAC TCC GTG GCA GAA TAT GAA GTT ACT CCA GAT GAA AAA CTG GGA GAG 1056 
Asp Ser Val Ala Glu Tyr Glu Val Thr Pro Asp Glu Lys Leu Gly Glu 
35 340 345 350 

AAA GGG AAG GAA ATT ATG ACC AAG TAC CTC ACC CCA AAG TCC CCT GTT 1104 

Lys Gly Lys Glu lie Met Thr Lys Tyr Leu Thr Pro Lys Ser Pro Val 

355 360 365 

40 

TTC ATA GCC CAA GTT GGC CAA GAC CTG GTC TCC CAG ACG GAG GAG AAG 1152 

Phe lie Ala Gin Val Gly Gin Asp Leu Val Ser Gin Thr Glu Glu Lys 
370 375 380 

45 CTC CTA CAG AAG CCG TGC AAA GAA CTC TTT TCT GCC TGT GCA CAG TCT 120 0 

Leu Leu Gin Lys Pro Cys Lys Glu Leu Phe Ser Ala Cys Ala Gin Ser 
385 390 395 400 

GTC CAC GAG TAC CTG AGG GGA GAA CCA TTC CAC GAA TAT CTG GAC AGC 124 8 

50 Val His Glu Tyr Leu Arg Gly Glu Pro Phe. His Glu Tyr Leu Asp Ser 

405 410 415 

ATG TTT TTT GAC CGC TTT CTC CAG TGG AAG TGG TTG GAA AGG CAA CCG 1296 
Met Phe Phe Asp Arg Phe Leu Gin Trp Lys Trp Leu Glu Arg Gin Pro 
55 420 425 430 
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vll ?Jr ir ""^^ ^ 

435 "^^^ ""^9 ^9 Val Leu Gly Lys Gly Gly 



440 



445 



Glv otu vll t^"" """^ '^^^ ^"^^ ATG TAT 

Phe Gly Glu val Cys Ala Cys Gin Val Arg Ala Thr Gly Lys Met Tyr 

45= 460 

A^f f ° ""^^ "^"^ ^ AAA GGG GAG 

Ala cys Lys Arg Leu Glu Lys Lys Arg He Lys Lys Arg Lys Gly Glu 

475 480 

Ser f ^ IT 7"" ""^^ GTC AAC AGT CAG 

Ser Met Ala Leu Asn Glu Lye Gin He Leu Glu Lys Val Asn Ser Gin 

48= 490 

vI? V^^' '''''' '"'^^ GCA CTG TGC 

Phe val val Asn Leu Ala Tyr Ala Tyr Glu Thr Lys Asp Ala Leu Cys 

500 505 

vl^" 7° ^u"" ''''^ '^'^ '^'^ '^T^ TTC CAC ATC TAC 

Leu val Leu Thr He Met Asn Gly Gly Asp Leu Lys Phe His He IVr 
515 520 

AAC ATG GGC AAC CCT GGC TTC GAG GAG GAG CGG GCC TTG TTT TAT GCG 
Asn Met Gly Asn Pro Gly Phe Glu Glu Glu Arg Ala Sa 

535 540 

Ala Glu He Leu Cys Gly Leu Glu Asp Leu His Arg Glu Asn Thr Val 
^ "° 555 560 

?Jr So Af r° ^ """^ ^^'^ TAT GGC CAC 

Tyr Arg Asp Leu Lys Pro Glu Asn He Leu Leu Asp Asp Tyr Gly His 
565 570 

xTe Ara H^ T"" ™ ^'^^ ^° ^-^^ ^CC GAG GGA GAC 

He Arg He Ser Asp Leu Gly Leu Ala Val Lys He Pro Glu Gly Asp 
580 585 590 

Su He Itn of °" ^-^^ GAA GTC 

Leu He Arg Gly Arg Val Gly Thr Val Gly Tyr Met Ala Pro Glu Val 
595 600 go5 

Su ""^ '■''^ ^""^ ™C °GC CTT GGC 

Leu Asn Asn Gin Arg Tyr Gly Leu Ser Pro Asp Tyr Trp Gly Leu Gly 

g20 

TGC CTC ATC TAT GAG ATG ATC GAG GGC CAG TCG CCG TTC CGC GGC CGT 
Cys Leu He Tyr Glu Met He Glu Gly Gin Ser Pro Phe Arg Gly Arg 

635 640 

AAG GAG AAG GTG AAG CGG GAG GAG GTG GAC CGC CGG GTC CTG GAG ACG 
Lys Glu Lys Val Lys Arg Glu Glu Val Asp Arg Arg Val Leu G^u rlr 

650 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 
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GAG GAG GTG TAG TCC CAC AAG TTC TCC GAG GAG GCC AAG TCC ATC TGC 2016 
Glu Glu Val Tyr Ser His Lys Phe Ser Glu Glu Ala Lys Ser lie Cys 
660 665 670 

5 AAG ATG CTG CTC ACG AAA GAT GCG AAG CAG AGG CTG GGC TGC GAG GAG 2064 
Lys Met Leu Leu Thr Lys Asp Ala Lys Gin Arg Leu Gly Cys Gin Glu 
675 680 685 

GAG GGG GCT GCA GAG GTC AAG AGA CAC CCC TTC TTC AGG AAC ATG AAC 2112 
10 Glu Gly Ala Ala Glu Val Lys Arg His Pro Phe Phe Arg Asn Met Asn 
690 695 700 

TTC AAG CGC TTA GAA GCC GGG ATG TTG GAC CCT CCC TTC GTT CCA GAC 2160 
Phe Lys Arg Leu Glu Ala Gly Met Leu Asp Pro Pro Phe Val Pro Asp 
15 705 710 715 720 

CCC CGC GCT GTG TAG TGT AAG GAC GTG CTG GAC ATC GAG CAG TTC TCC 2208 

Pro Arg Ala Val Tyr Cys Lys Asp Val Leu Asp lie Glu Gin Phe Ser 
725 730 735 

20 

ACT GTG AAG GGC GTC AAT CTG GAC CAC ACA GAC GAC GAC TTC TAC TCC 22 56 

Thr Val Lys Gly Val Asn Leu Asp His Thr Asp Asp Asp Phe Tyr Ser 
740 745 750 

25 AAG TTC TCC ACG GGC TCT GTG TCC ATC CCA TGG CAA AAC GAG ATG ATA 23 04 

Lys Phe Ser Thr Gly Ser Val Ser lie Pro Trp Gin Asn Glu Met He 
755 760 765 

GAA ACA GAA TGC TTT AAG GAG CTG AAC GTG TTT GGA CCT AAT GGT ACC 2352 
30 Glu Thr Glu Cys Phe Lys Glu Leu Asn Val Phe Gly Pro Asn Gly Thr 
770 775 780 

CTC CCG CCA GAT CTG AAC AGA AAC CAC CCT CCG GAA CCG CCC AAG AAA 2400 
Leu Pro Pro Asp Leu Asn Arg Asn His Pro Pro Glu Pro Pro Lys Lys 
35 785 790 795 800 

GGG CTG CTC CAG AGA CTC TTC AAG CGG CAG CAT CAG AAC AAT TCC AAG 244 8 

Gly Leu Leu Gin Arg Leu Phe Lys Arg Gin His Gin Asn Asn Ser Lys 
805 810 815 

40 

AGT TCG CCC AGC TCC AAG ACC AGT TTT AAC CAC CAC ATA AAC TCA AAC 24 96 

Ser Ser Pro Ser Ser Lys Thr Ser Phe Asn His His He Asn Ser Asn 
820 825 830 

45 CAT GTC AGC TCG AAC TCC ACC GGA AGC AGC TAG 2529 
His Val Ser Ser Asn Ser Thr Gly Ser Ser 
835 840 

50 (2) INFORMATION FOR SEQ ID N0:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 amino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



79 



10 



20 



30 



50 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Met val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

10 

val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

Glu Gly Glu Gly Asp Ala Thr ^ G^y Lys Leu Thr Leu Lys Phe He 

Cys Thr Thr Gly Lys Leu Pro "l Pro Trp Pro Thr "u Val Thr Thr 
•I c 55 

15 Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg ry. Pro Asp His Met Lys 

Gin His ASP Phe Phe Lys Ser Ala Met Pro cL Gly Tyr Val Gin II 

90 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg aL Glu 

val Lys Phe Glu Gly Asp Thr Leu HI Asn Arg He Glu Lys Gly 

_ _ 12 0 1 pc; 

lie ASP Phe Lys Glu Asp Gly Asn Xle Leu Gly His Lys Leu Glu Tyr 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Jsp Lys Gin Lys Asn 

Gly He Lys val Asn Phe Lys He Arg His He Glu Asp Gly III 

170 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly L'p Gly 
Pro val Leu Leu Pro Asp Asn His "r Leu Ser Thr Gin Ser Ala Leu 



ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 



35 val Thr Ala Ala Gly xle Thr Leu Gly Met Asp III Leu Tyr Lys Ser 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ser Met Glu Leu ll^ 
Asn He Val Ala Asn Thr Val Leu Leu Lys Ala Arg Glu Gly g" Gly 
Gly Lys Arg Lys Gly Lys Ser Lys l" Trp Lys Glu He 12 Lys Phe 
pro His He Ser Gin Cys Glu As'p Leu Arg Arg Thr h" Asp Arg Asp 
45 Tyr Cys Ser Leu Cys Asp Lyf Gin Pro He Gly Arg" Leu Leu Phe Arg 



315 



om Phe cys Glu Thr Arg Pro Gly Leu Glu Cys^ Tyr He Gin Phe IZ 
ASP ser val Ala Glu Tyr Glu Val Thr Pro Asp Glu Lys Leu Oly Glu 
I^ys Gly Lys Glu He Met Thr Lys JJr Leu Thr Pro Lys Ser Pro Val 

360 3 R 

Phe He Ala Gin Val Gly Gin Asp Leu Val Ser Gin Thr Glu Glu Lys 
I.eu Leu Gin Lys Pro Cys Lys Glu Leu Phe Ser a" Cys Ala Gin Ser 

400 
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Val 


His 


Glu 


Tyr 


Leu 


Arg 


Gly Glu 


Pro 


Phe 


His 


Glu 


Tyr 


Leu 


Asp 


Ser 












405 










410 










415 






Met 


Phe 


Phe 


Asp 


Arg 


Phe 


Leu 


Gin 


Trp 


Lys 


Trp 


Leu 


Glu 


Arg 


Gin 


Pro 










42 0 










425 










43 0 






5 


Val 


Thr 


Lys 


Asn 


Thr 


Phe 


Arg 


Gin 


Tyr 


Arg 


Val 


Leu 


Gly 


Lys 


Gly 


Gly 








435 










440 










445 










Phe 


Gly 


Glu 


Val 


Cys 


Ala 


Cys 


Gin 


Val 


Arg 


Ala 


Thr 


Gly 


Lys 


Met 


Tyr 






450 










455 










460 












Ala 


Cys 


Lys 


Arg 


Leu 


Glu 


Lys 


Lys 


Arg 


He 


Lys 


Lys 


Arg 


Lys 


Gly 


Glu 


1 u 












470 










Aid 
4 / b 










A Qf\ 
4 a U 




Ser 


Met 


Ala 


Leu 


Asn 


Glu 


Lys 


Gin 


He 


Leu 


Glu 


Lys 


Val 


Asn 


Ser 


Gin 












485 










490 
















Phe 


Val 


Val 


Asn 


Leu 


Ala 


Tyr 


Ala 


Tyr 


Glu 


Thr 


Lys 


Asp 


Ala 


Leu 


Cys 










500 










505 










510 






15 


Leu 


Val 


Leu 


Thr 


He 


Met 


Asn 


Gly 


Gly Asp 


Leu 


Lys 


Phe 


His 


He 


Tyr 








515 










520 










525 










Asn 


Met 


Gly 


Asn 


Pro Gly 


Phe 


Glu 


Glu 


Glu 


Arg 


Ala 


Leu 


Phe 


Tyr 


Ala 






530 










535 










540 












Ala 


Glu 


lie 


Leu 


Cys 


Gly 


Leu Glu Asp 


Leu 


His 


Arg 


Glu 


Asn 


Thr 


Val 


20 


545 










550 










555 










560 




Tyr 


Arg 


Asp 


Leu 


Lys 


Pro 


Glu 


Asn 


He 


Leu 


Leu 


Asp 


Asp 


Tyr 


Gly 


His 












565 










570 










575 






lie 


Arg 


lie 


Ser 


Asp 


Leu 


Gly 


Leu 


Ala 


Val 


Lys 


He 


Pro 


Glu 


Gly 


Asp 










580 










585 










590 






25 


Leu 


lie 


Arg 


Gly 


Arg 


Val 


Gly 


Thr 


Val 


Gly 


Tyr 


Met 


Ala 


Pro 


Glu 


Val 








595 










600 










605 










Leu 


Asn 


Asn 


Gin 


Arg 


Tyr 


Gly 


Leu 


Ser 


Pro 


Asp 


Tyr 


Trp 


Gly 


Leu 


Gly 






610 










615 










620 












Cys 


Leu 


lie 


Tyr 


Glu 


Met 


He 


Glu 


Gly Gin 


Ser 


Pro 


Phe 


Arg 


Gly 


Arg 


30 


625 










630 










635 










640 




Lys 


Glu 


Lys 


Val 


Lys 


Arg 


Glu 


Glu 


Val 


Asp 


Arg 


Arg 


Val 


Leu 


Glu 


Thr 












645 










650 










655 






Glu 


Glu 


Val 


Tyr 


Ser 


His 


Lys 


Phe 


Ser 


Glu 


Glu 


Ala 


Lys 


Ser 


He 


Cys 










660 










665 










670 






35 


Lys 


Met 


Leu 


Leu 


Thr 


Lys 


Asp 


Ala 


Lys 


Gin 


Arg 


Leu 


Gly 


Cys 


Gin 


Glu 








675 










680 










685 










Glu 


Gly Ala 


Ala 


Glu 


Val 


Lys 


Arg 


His 


Pro 


Phe 


Phe 


Arg 


Asn 


Met 


Asn 






690 










695 










700 












Phe 


Lys 


Arg 


Leu 


Glu 


Ala 


Gly Met 


Leu 


Asp 


Pro 


Pro 


Phe 


Val 


Pro 


Asp 


40 


705 










710 










715 










720 




Pro 


Arg 


Ala 


Val 


Tyr 


Cys 


Lys 


Asp 


Val 


Leu 


Asp 


He 


Glu 


Gin 


Phe 


Ser 












725 










730 










735 






Thr 


Val 


Lys 


Gly 


Val 


Asn 


Leu 


Asp 


His 


Thr 


Asp 


Asp 


Asp 


Phe 


Tyr 


Ser 










740 










745 










750 






45 


Lys 


Phe 


Ser 


Thr 


Gly 


Ser 


Val 


Ser 


He 


Pro 


Trp 


Gin 


Asn 


Glu 


Met 


He 








755 










760 










765 










Glu 


Thr 


Glu 


Cys 


Phe 


Lys 


Glu 


Leu 


Asn 


Val 


Phe 


Gly 


Pro 


Asn 


Gly 


Thr 






770 










775 










780 












Leu 


Pro 


Pro 


Asp 


Leu 


Asn 


Arg Asn 


His 


Pro 


Pro 


Glu 


Pro 


Pro 


Lys 


Lys 


50 


785 










790 










795 










800 




Gly 


Leu 


Leu 


Gin 


Arg 


Leu 


Phe 


Lys 


Arg 


Gin 


His 


Gin 


Asn 


Asn 


Ser 


Lys 












805 










810 










815 






Ser 


Ser 


Pro 


Ser 


Ser 


Lys 


Thr 


Ser 


Phe 


Asn 


His 


His 


He 


Asn 


Ser 


Asn 










820 










825 










830 






55 


His 


Val 


Ser 


Ser 


Asn 


Ser 


Thr Gly 


Ser 


Ser 




















835 










840 
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(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1899 
(D) OTHER INFORMATION: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 
Met Val ser Lys Gly Glu Glu Leu Phe Thr Gly Val Va? Pro Til lit 
^ 10 



15 



vll G?u i2 nf ^^'^ ^^'^ ^^<= GTG TCC GGC 

val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Sy 

25 30 

Su ''•^'^ ''^^ ^-^^ CTG AAG TTC ATC 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Sle ill 



40 



45 



TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG arf Ann 
cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro ^h'r l": vI? ^Sr JSr 



55 60 



s z s s I- s s s z ^. ^ 

70 75 



80 



sj: s= ^ - - s - s s 51? o^?^ 



90 



95 



?hr Tit III III r f " r ™^ ^'^^ 

g Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
"° 105 110 

^l 7^1 G^u G?v f ' 
al Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

120 

S Z T^. ^. z s= - - - -J s= - 

135 

^ S Z S S - ^ 



48 
96 
144 
192 
240 
288 
336 
384 
432 
480 

82 
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145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAG GGC AGO 528 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAG CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

15 AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
20 Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CGA GCC ATC ATG AGC AGA AGC AAG CGT GAC 76 8 

Gly Leu Arg Ser Arg Ala Arg Ala He Met Ser Arg Ser Lys Arg Asp 
25 245 250 255 

AAC AAT TTT TAT AGT GTA GAG ATT GGA GAT TCT ACA TTC AC A GTC CTG 816 
Asn Asn Phe Tyr Ser Val Glu He Gly Asp Ser Thr Phe Thr Val Leu 
260 265 270 

30 

AAA CGA TAT CAG AAT TTA AAA CCT ATA GGC TCA GGA GCT CAA GGA ATA 864 
Lys Arg Tyr Gin Asn Leu Lys Pro He Gly Ser Gly Ala Gin Gly He 
275 280 285 

35 GTA TGC GCA GCT TAT GAT GCC ATT CTT GAA AGA AAT GTT GCA ATC AAG 912 
Val Cys Ala Ala Tyr Asp Ala He Leu Glu Arg Asn Val Ala He Lys 
290 295 300 

AAG CTA AGC CGA CCA TTT CAG AAT CAG ACT CAT GCC AAG CGG GCC TAC 960 
40 Lys Leu Ser Arg Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala Tyr 
305 310 315 320 

AGA GAG CTA GTT CTT ATG AAA TGT GTT AAT CAC AAA AAT ATA ATT GGC 100 8 

Arg Glu Leu Val Leu Met Lys Cys Val Asn His Lys Asn He He Gly 
45 325 330 335 

CTT TTG AAT GTT TTC ACA CCA CAG AAA TCC CTA GAA GAA TTT CAA GAT 1056 

Leu Leu Asn Val Phe Thr Pro Gin Lys Ser Leu Glu Glu Phe Gin Asp 
340 345 350 

50 

GTT TAC ATA GTC ATG GAG CTC ATG GAT GCA AAT CTT TGC CAA GTG ATT 1104 

Val Tyr He Val Met Glu Leu Met Asp Ala Asn Leu Cys Gin Val He 
355 360 365 

55 CAG ATG GAG CTA GAT CAT GAA AGA ATG TCC TAC CTT CTC TAT CAG ATG 1152 
Gin Met Glu Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin Met 
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370 



380 



Zl Tiy xll 

385 ^ ^ ''^^ "-^^ He He His Arg Asp 

395 400 

TTA AAG CCC AGT AAT ATA GTA OTA AAA TCT GAT TGC ACT TTG AAG ATT 
Leu ^y. Pro Ser Asn He Val Val l.ys Ser Asp Cys ^hr llu x7. 



410 



415 



CTT GAC TTC GGT CTG GCC AGG ACT GCA GGA ACG AGT TTT ATG ATG ACG 
Leu ASP Phe Gly Leu Ala Arg Thr Ala Gly Thr Ser pJI Set 
"0 425 

Pro tli f ? ■'''^ ^ ATC CTT GGC 

Pro Tyr Val Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val He Leu Gly 



440 



445 



MI^ "^"^ °™ "A TCT GTG GGG TGC ATT ATG 

Met Gly Tyr Lys Glu Acn Val Asp Leu Trp Ser Val Gly c^s il^ 

460 

GGA GAA ATG GTT TGC CAC AAA ATC CTC TTT CCA GGA AGG GAC TAT ATT 
Gly Glu Met Val Cys His Lys He-Leu Phe Pro Gly Arg Asp ^r hI 

475 480 

GAT CAG TGG AAT AAA GTT ATT GAA CAG CTT GGA ACA CCA TGT CCT GAA 
ASP Gin Trp Asn Lys Val He Glu Gin Leu Gly Thr Pro Cyl p" 1^ 
'^^^ 490 495 

TTC ATG AAG AAA CTG CAA CCA ACA GTA AGG ACT TAC GTT GAA AAC AGA 
Phe Met Lys Lys Leu Gin Pro Thr Val Arg Thr Tyr Val Glu ^g 

505 

CCT AAA TAT GCT GGA TAT AGC TTT GAG AAA CTC TTC CCT GAT GTC CTT 
Pro Lys Tyr Ala Gly Tyr Ser Phe Glu Lys Leu Phe tro Asp vll Zl 

520 

TTC CCA GCT GAC TCA GAA CAC AAC AAA CTT AAA GCC AGT CAG GCA AGG 
Phe Pro Ala Asp Ser Glu His Asn Lys Leu Lys Ala Ser Oln aS ^ 

540 

Aso T"" GCA TCT AAA AGG ATC TCT 

ASP Leu Leu ser Lys Met Leu Val He Asp Ala Ser Lys Arg He s" 

"° 555 560 

vll Zl rf 17 '^^'^ '''''' ™^ *TC AAT GTC TGG TAT GAT CCT 

val ASP Glu Ala Leu Gin His Pro Tyr He Asn Val Trp Tyr Asp Pro 

570 5„ 

IT. ^. s - s ^, tr. zi ir. ™ 



585 



590 



1200 

1248 

1296 

1344 

1392 

1440 

1488 

1536 

1584 

1632 

1680 

1728 

1776 

1824 

84 
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595 600 605 

GTT ATG GAC TTG GAG GAG AGA ACC AAG AAT GGA GTT ATA CGG GGG CAG 1872 
Val Met Asp Leu Glu Glu Arg Thr Lys Asn Gly Val lie Arg Gly Gin 
610 615 620 

CCC TCT CCT TTA GCA CAG GTG CAG CAG TGA 1902 
Pro Ser Pro Leu Ala Gin Val Gin Gin 
625 630 

(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 633 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



25 


Met 
1 


Val 


Ser 


Lys 


Gly 
5 


Glu 


Glu 


Leu 


Phe 


Thr 
10 


Gly 


Val 


Val 


Pro 


He 
15 


Leu 




Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val 


Asn 


Gly 


His 


Lys 


Phe 


Ser 


Val 


Ser Gly 










20 










25 










30 








Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


He 


30 






35 










40 










45 










Cys 


Thr 
50 


Thr 


Gly 


Lys 


Leu 


Pro 
55 


Val 


Pro 


Trp 


Pro 


Thr 
60 


Leu 


Val 


Thr 


Thr 




Leu 


Thr 


Tyr 


Gly 


Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 


Pro 


Asp 


His 


Met 


Lys 




65 










70 










75 










80 


35 


Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly Tyr 


val 


Gin 


Glu 












85 










90 










95 






Arg 


Thr 


He 


Phe 
100 


Phe 


Lys 


Asp 


Asp 


Gly 
105 


Asn 


Tyr 


Lys 


Thr 


Arg 
110 


Ala 


Glu 




Val 


Lys 


Phe 


Glu 


Gly 


Asp 


Thr 


Leu 


Val 


Asn 


Arg 


He 


Glu 


Leu 


Lys 


Gly 


40 






115 










120 










125 










He 


Asp 
130 


Phe 


Lys 


Glu 


Asp 


Gly 
135 


Asn 


He 


Leu 


Gly 


His 
140 


Lys 


Leu 


Glu 


Tyr 




Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


He 


Met 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 




145 










150 










155 










160 


45 


Gly 


He 


Lys 


Val 


Asn 
165 


Phe 


Lys 


He 


Arg 


His 
170 


Asn 


He 


Glu 


Asp 


Gly 
175 


Ser 




Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


Pro 


He 


Gly Asp Gly 










180 










185 










190 








Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 


Thr 


Gin 


Ser 


Ala 


Leu 


50 






195 










200 










205 










Ser 


Lys 
210 


Asp 


Pro 


Asn 


Glu 


Lys 
215 


Arg 


Asp 


His 


Met 


Val 
220 


Leu 


Leu 


Glu 


Phe 




Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr 


Leu 


Gly 


Met 


Asp 


Glu 


Leu 


Tyr 


Lys 


Ser 




225 










230 










235 










240 


55 


Gly 


Leu 


Arg 


Ser 


Arg 
245 


Ala 


Arg 


Ala 


He 


Met 
250 


Ser 


Arg 


Ser 


Lys 


Arg 
255 


Asp 
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Asn Asn Phe Tyr Ser Val Glu He Gly Asp ser Thr Phe Thr Val Leu 

Lys Arg Tyr Gin Asn Leu Lys Pro lie Gly ser Gly Ala lln Gly He 

5 val cys Ala Ala Tyr Asp Ala III I,eu Glu Arg Asn III Ala He Lys 

Lys Leu ser Arg Pro Phe Gin Asn Gin Thr His "a Lys Arg Ala Tyr 

Arg Glu Leu Val Leu Met Lys Cys Val Asn Hx" Lys Asn He He G^y 

Leu Leu Asn Val Phe Thr Pro Gin Lys Ser Leu Glu Glu Phe Gin Asp 

350 



30 



40 



50 



val Tyr lie Val Met Glu Leu Met Asp Ala Asn Leu Cys Gin Val He 

360 

Gin Met Glu Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin Met 
I-eu cys Gly He Lys His Leu His Ser Ala Gly lie He His Arg Asp 



20 """^ ""^^ 



375 

Leu His Ser Ala Gly 
390 ->Qc 

Lys 

Leu ASP Phe Gly Leu Ala Arg Thr Ala Gl" Thr Ser Phe Met Met Thr 
Pro Tyr Val Val Thr Arg Tyr Tyr l^g Ala Pro Glu Val He Leu Gly 
25 Met G3v Tv-r T.^ro m,, TV — Val ~ ~ 

4S5 460 



Mat Gly Tyr Lys Glu Asn Val Asp Leu Trp Ser Val Gly Cys He Met 

455 

Gly Glu Met val Cys His Lys He Leu Phe Pro Gly Arg Asp Tyr He 
Asp Gin Trp Asn Lys Val He Glu Gin Leu g" Thr Pro Cys Pro X 

w 490 4Qb; 

Phe Met Lys Lys Leu Gin Pro Thr Val Arg Thr Tyr Val Glu Asn Arg 



500 qne; 

505 



Pro Lys Tyr Ala Gly Tyr Ser Phe Glu Lys Leu Phe Pro Asp Val Leu 

35 Phe Pro Ala Asp Ser Glu His IZ Lys Leu Lys Ala Ser Gin Ala Arg 

ASP Leu Leu ser Lys Met Leu Val He Asp Ala III Lys Arg He Ser 
"° 555 



val ASP Glu Ala Leu Gin His Pro Tyr He Asn Val Trp Tyr Asp Pro 

570 

ser Glu Ala Glu Ala Pro Pro Pro Lys He Pro Asp Lys Gin Le'u Asp 

585 Ron 
Glu Arg Glu His Thr He Glu Glu Trp Lys Glu Leu He "r Lys Glu 

yic 600 

val Met ASP Leu Glu Glu Arg Thr Lys Asn Gly Val He Arg Gly Gin 



615 g20 
Pro Ser Pro Leu Ala Gin Val Gin Gin 

(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1824 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1821 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

15 GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 

20 Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
25 50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

30 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

35 CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 33 6 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 384 

40 Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
45 130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

50 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

55 GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
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CCC GTG CTG CTG CCC GAC AAC CAC TAC CT6 AGC ACC CAG TCC GCC CTG 
Pro val ^eu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin str Sa 

-»- O o n rt 



200 205 



AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATO GTC CTG CTG GAG TTC 
ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu G^u Phe 



220 



GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 
Va Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Ser 
230 235 



240 



GGA CTC AGA TCT CGA GGG AAA ATG TCT CAG GAG AGG CCC ACG TTC TAC 
Gly Leu Arg Ser Arg Gly Lys Met Ser Gin Glu Arg Pro Thr Se 

250 



255 



CGG CAG GAG CTG AAC AAG ACA ATC TGG GAG GTG CCC GAG CGT TAC CAG 

Arg Gin Glu Leu Asn Lys Thr lie Trn ri,i v^i t. 

260 ° 

265 270 

AAC CTG TCT CCA GTG GGC TCT GGC GCC TAT GGC TCT GTO TGT GCT GCT 
Asn Leu ser Pro Val Gly Ser Gly Ala Tyr Gly Ser Val Cys pll S 

280 285 

Ph! T"" ™ '^'^ GCA GTG AAG AAG CTC TCC AGA 

Phe ASP Thr Lys Thr Gly Leu Arg Val Ala Val Lys Lys Leu Ser Arg 

pS ^ G^n S^r ^ ^'^^ ^TG CGG 

Pro Phe Gin Ser He lie Hxs Ala Lys Arg Thr Tyr Arg Glu Leu Arg 

315 

TTA CTT AAA CAT ATG AAA CAT GAA AAT GTG ATT GGT CTG TTG GAC GTT 
Leu Leu Lys H.s Met Lys His Glu Asn Val He Gly Leu lS A^p 
325 330 335 

TTT ACA CCT GCA AGG TCT CTG GAG GAA TTC AAT GAT GTG TAT CTG GTG 
Phe Thr Pro Ala Arg Ser Leu Glu Glu Phe Asn Asp vll lyl vlt 

345 

360 365 

CTT ACA GAT GAC CAT GTT CAG TTC CTT ATC TAC CAA ATT CTC CGA GGT 
Leu Thr ASP ASP His Val Gin Phe Leu lie Tyr Gin hI "u A^ Gly 

^'^^ 380 

jjs Sr He ufs iTr r r t ^ 

Lys Tyr He His Ser Ala Asp He He His Arg Asp Leu Lys Pro 

AGT AAT CTA GCT GTG AAT GAA GAC TGT GAG CTG AAG ATT CTG GAT TTT 
ser Asn Leu Ala Val Asn Glu Asp Cys Glu Leu Lys He Su Asp pJI 



624 
672 
720 
768 
816 
864 
912 
960 
1008 
1056 
1104 
1152 
1200 
1248 
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405 410 415 

GGA CTG GCT CGG CAC ACA GAT GAT GAA ATG ACA GGC TAG GTG GCC ACT 1296 
Gly Leu Ala Arg His Thr Asp Asp Glu Met Thr Gly Tyr Val Ala Thr 
420 425 430 

AGG TGG TAG AGG GCT CCT GAG ATC ATG CTG AAC TGG ATG CAT TAG AAC 1344 
Arg Trp Tyr Arg Ala Pro Glu lie Met Leu Asn Trp Met His Tyr Asn 
435 440 445 

CAG ACA GTT GAT ATT TGG TCA GTG GGA TGC ATA ATG GCC GAG CTG TTG 13 92 

Gin Thr Val Asp lie Trp Ser Val Gly Cys He Met Ala Glu Leu Leu 
450 455 460 

15 ACT GGA AGA ACA TTG TTT CCT GGT ACA GAC CAT ATT GAT CAG TTG AAG 144 0 

Thr Gly Arg Thr Leu Phe Pro Gly Thr Asp His He Asp Gin Leu Lys 
465 470 475 480 

CTC ATT TTA AGA CTC GTT GGA ACC CCA GGG GCT GAG CTT TTG AAG AAA 1488 
20 Leu He Leu Arg Leu Val Gly Thr Pro Gly Ala Glu Leu Leu Lys Lys 

485 490 495 

ATC TCC TCA GAG TCT GCA AGA AAC TAT ATT CAG TCT TTG ACT CAG ATG 153 6 

He Ser Ser Glu Ser Ala Arg Asn Tyr He Gin Ser Leu Thr Gin Met 
25 500 505 510 

COG AAG ATG AAC TTT GCG AAT GTA TTT ATT GGT GCC AAT CCC CTG GCT 1584 

Pro Lys Met Asn Phe Ala Asn Val Phe He Gly Ala Asn Pro Leu Ala 

515 520 525 

30 

GTC GAC TTG CTG GAG AAG ATG CTT GTA TTG GAC TCA GAT AAG AGA ATT 1632 

Val Asp Leu Leu Glu Lys Met Leu Val Leu Asp Ser Asp Lys Arg He 

530 535 540 

35 ACA GCG GCC CAA GCC CTT GCA CAT GCC TAG TTT GCT CAG TAC CAC GAT 1680 
Thr Ala Ala Gin Ala Leu Ala His Ala Tyr Phe Ala Gin Tyr His Asp 
545 550 ■ 555 560 

CCT GAT GAT GAA CCA GTG GCC GAT CCT TAT GAT CAG TCC TTT GAA AGC 1728 
40 Pro Asp Asp Glu Pro Val Ala Asp Pro Tyr Asp Gin Ser Phe Glu Ser 

565 570 575 

AGG GAC CTC CTT ATA GAT GAG TGG AAA AGC CTG ACC TAT GAT GAA GTC 1776 
Arg Asp Leu Leu He Asp Glu Trp Lys Ser Leu Thr Tyr Asp Glu Val 
45 580 585 590 

ATC AGC TTT GTG CCA CCA CCC CTT GAC CAA GAA GAG ATG GAG TCC TGA 1824 
He Ser Phe Val Pro Pro Pro Leu Asp Gin Glu Glu Met Glu Ser 
595 600 605 

50 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 607 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met val ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

10 I c 

val Glu Leu ASP Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
Glu Gly Glu Gly Asp Ala Thr Tyr "y Lys Leu Thr Leu Lys Phe He 

Cys Thr Thr Gly Lys Leu Pro ill Pro Trp Pro Thr IL Val Thr Thr 

60 

jeu Thr ^r Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

Gin His ASP Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin gL 

90 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
val Lys Phe Glu Gly Asp Thr Leu III Asn Arg He Glu 111 Lys Gly 
lie ASP Phe Lys Glu Asp Gly A^^ He Leu Gly His lys Leu Glu Tyr 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

155 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

val Gin Leu Ala Asp His Tyr Gin Gin 111 Thr Pro He Gly ^p Gly 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 205 
ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

val Thr Ala Ala Gly lie T^r Leu Gly Met Asp 111 Leu Tyr Lys Ser 

Gly Leu Arg Ser Arg Gly Lys Met Ser Gin "u Arg Pro Thr Phe ^yr 

Arg Gin Glu Leu Asn Lys Thr He Trp Glu Val Pro Glu Arg l^l Gin 

^oO 265 
Asn Leu ser Pro Val Gly Ser Gly Ala Tyr Gly Ser Val Cys Ala Ala 

Phe ASP Thr Lys Thr Gly Leu Arg Val Ala Val Lys Lys Leu Ser Arg 

300 

Pro Phe Gin Ser He He His Ala Lys Arg Thr Tyr Arg Glu Leu Arg 

315 

Leu Leu Lys His Met Lys His Glu Asn Val He Gly Leu Leu Asp III 

Phe Thr Pro Ala Arg Ser Leu Glu Glu Phe Asn Asp Val Tyr Leu Val 

Thr His Leu Met Gly Ala Asp Leu III Asn He Val Lys Cys Gin Lys 

Leu Thr ASP ASP His Val Gin 111 x.eu He Tyr Gin He Leu Arg Gly 
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Leu 


Lys 


Tyr 


He 


His 


Ser 


Ala 


Asp 


He 


lie 


His 


Arg 


Asp 


Leu 


Lys 


Pro 




385 










390 










395 










400 




Ser 


Asn 


Leu 


Ala 


vaj. 
405 


Asn 


Glu 


Asp 


Cys 


Glu 

410 


Leu 


Lys 


lie 


Leu 


Asp 
415 


Phe 


c 
D 


Gly 


Leu 


Ala 


Arg 

42 0 


His 


Thr 


Asp 


Asp 


Glu 

/IOC 


Met 


Thr 


Gly 


Tyr 


Val 
4 30 


Ala 


Thr 




Arg 


Trp 


Tyr 

i1 c 
4 J D 


Arg 


A 1 -1 

A J. a 


Pro 


Glu 


He 
440 


Met 


Leu 


Asn 


Trp 


Met 
44 5 


Hi s 


Tyr 


Asn 




Gin 


Thr 


Val 


Asp 


lie 


Trp 


Ser 


Val 


Gly 


Cys 


He 


Met 


TV T 

Ala 


Glu 


Leu 


Leu 


in 




450 










455 










460 












Thr 


Gly 


Arg 


Thr 


Leu 


Pne 


Pro Gly 


Thr 


Asp 


His 


He 


Asp 


Gin 


Leu 


Lys 




465 










470 










475 










480 




Leu 


He 


Leu 


Arg 


Leu 
485 


Val 


Gly 


Thr 


Pro 


Gly 
490 


Ala 


Glu 


Leu 


Leu 


Lys 
495 


Lys 


It) 


He 


Ser 


Ser 


Glu 
500 


Ser 


Ala 


Arg 


Asn 


Tyr 
505 


He 


Gin 


Ser 


Leu 


Thr 
510 


Gin 


Met 




Pro 


Lys 


Met 


Asn 


Phe 


Ala 


Asn 


Val 


Phe 


He 


Gly Ala 


Asn 


Pro 


Leu 


Ala 








515 










520 










525 










Val 


Asp 


Leu 


Leu 


Glu 


Lys 


Met 


Leu 


Val 


Leu 


Asp 


Ser 


Asp 


Lys 


Arg 


He 


20 




530 










535 










540 












Thr 


Ala 


Ala 


Gin 


Ala 


Leu 


Ala 


His 


Ala 


Tyr 


Phe 


Ala 


Gin 


Tyr 


His 


Asp 




545 










550 










555 










560 




Pro 


Asp 


Asp 


Glu 


Pro 
565 


Val 


Ala 


Asp 


Pro 


Tyr 
570 


Asp 


Gin 


Ser 


Phe 


Glu 
575 


Ser 


25 


Arg 


Asp 


Leu 


Leu 
580 


He 


Asp 


Glu 


Trp 


Lys 
585 


Ser 


Leu 


Thr 


Tyr 


Asp 
590 


Glu 


Val 




He 


Ser 


Phe 
595 


Val 


Pro 


Pro 


Pro 


Leu 
600 


Asp 


Gin 


Glu 


Glu 


Met 
605 


Glu 


Ser 





30 (2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2907 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

40 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2904 
(D) OTHER INFORMATION: 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4 8: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 ' 15 

50 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

55 GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
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92 

45 



TC3C ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTC ACC ACC 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Vl rlr Jhr 

55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGO CGC TAC CCC GAC CAC ATG AAG 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 
Gin Has ASP Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr V.l 111 oil 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

105 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 
val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Sy 

120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu ^r 
"° 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys ^l 

155 160 

tT^ r° "^"^ ''^^ ^^'^ ^-^^ GAG GAC GGC AGC 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

vll G?n III l^"" f '^'^^ ^'^^ ATC GGC GAC GGC 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 
Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 205 

ill ^ l"""" '^'^ ^AC ATG GTC CTG CTG GAG TTC 

ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

215 220 

vll ?hr A^ l^"" ''T'' '^'^ °AG CTG TAC AAG TCC 

val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
" 230 235 240 



GGA CTC AGA TCT ATG AGT GCT GAG GGG TAC CAG TAC AGA GCG CTG TAT 
Gly Leu Arg Ser Met Ser Ala Glu Gly Tyr Gin Tyr Arg Ala lIu 



2" 250 255 



GAT TAT AAA AAG GAA AGA GAA GAA GAT ATT GAC TTG CAC TTG GGT GAC 
ASP Tyr Lys Lys Glu Arg Glu Glu Asp He Asp Leu His lIu Ty Sp 



192 

240 

288 

336 

384 

432 

480 

528 

576 

624 

672 

720 

768 

816 
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260 



265 



270 



ATA TTG ACT GTG AAT AAA GGG TCC TTA GTA GCT CTT GGA TTC AGT GAT 
lie Leu Thr Val Asn Lys Gly Ser Leu Val Ala Leu Gly Phe Ser Asp 

275 280 285 



864 



10^ 



GGA CAG GAA 
Gly Gin Glu 
290 

GAA ACC ACA 
Glu Thr Thr 
305 



GCC AGG 
Ala Arg 



GGG GAA 
Gly Glu 



CCT GAA 
Pro Glu 
295 

AGG GGG 
Arg Gly 
310 



GAA ATT GGC TGG 
Glu lie Gly Trp 



GAC TTT CCG GGA 
Asp Phe Pro Gly 
315 



TTA AAT 
Leu Asn 
300 

ACT TAC 
Thr Tyr 



GGC TAT AAT 
Gly Tyr Asn 



GTA GAA TAT 
Val Glu Tyr 
320 



912 



960 



15 ATT GGA AGG AAA AAA ATC TCG CCT CCC ACA CCA AAG CCC CGG CCA CCT 
lie Gly Arg Lys Lys lie Ser Pro Pro Thr Pro Lys Pro Arg Pro Pro 
325 330 335 



1008 



CGG CCT CTT CCT GTT GCA CCA GGT TCT TCG AAA ACT GAA GCA GAT GTT 
20 Arg Pro Leu Pro Val Ala Pro Gly Ser Ser Lys Thr Glu Ala Asp Val 
340 345 350 



1056 



25 



GAA CAA CAA GCT TTG ACT CTC CCG GAT CTT GCA GAG CAG TTT GCC CCT 
Glu Gin Gin Ala Leu Thr Leu Pro Asp Leu Ala Glu Gin Phe Ala Pro 
355 360 365 



1104 



30 



CCT GAC ATT 
Pro Asp lie 
370 

AAG AAA GGT 
Lys Lys Gly 
385 



GCC CCG 
Ala Pro 



CTG GAA 
Leu Glu 



CCT CTT 
Pro Leu 
375 

TGT TCA 
Cys Ser 
390 



CTT ATC AAG CTC 
Leu lie Lys Leu 



ACT CTA TAC AGA 
Thr Leu Tyr Arg 
395 



GTG GAA 
Val Glu 
380 

ACA CAG 
Thr Gin 



GCC ATT GAA 
Ala lie Glu 



AGC TCC AGC 
Ser Ser Ser 
400 



1152 



1200 



35 AAC CTG GCA GAA TTA CGA CAG CTT CTT GAT TGT GAT ACA CCC TCC GTG 
Asn Leu Ala Glu Leu Arg Gin Leu Leu Asp Cys Asp Thr Pro Ser Val 
405 410 415 



1248 



GAC TTG GAA ATG ATC GAT GTG CAC GTT TTG GCT GAC GCT TTC AAA CGC 
40 Asp Leu Glu Met He Asp Val His Val Leu Ala Asp Ala Phe Lys Arg 
420 425 430 



1296 



45 



TAT CTC CTG 
Tyr Leu Leu 
435 



GAC TTA CCA AAT CCT GTC ATT CCA GCA GCC 
Asp Leu Pro Asn Pro Val He Pro Ala Ala 
440 445 



GTT TAC AGT 
Val Tyr Ser 



1344 



50 



GAA ATG ATT 
Glu Met He 
450 

CAG CTA TTG 
Gin Leu Leu 
465 



TCT TTA 
Ser Leu 



AAG AAG 
Lys Lys 



GCT CCA 
Ala Pro 
455 

CTT ATT 
Leu He 
470 



GAA GTA CAA AGC 
Glu Val Gin Ser 



AGG TCG CCT AGC 
Arg Ser Pro Ser 
475 



TCC GAA 
Ser Glu 
460 

ATA CCT 
He Pro 



GAA TAT ATT 
Glu Tyr He 



CAT CAG TAT 
His Gin Tyr 
480 



1392 



1440 



55 TGG CTT ACG CTT CAG TAT TTG TTA AAA CAT TTC TTC AAG CTC TCT CAA 
Trp Leu Thr Leu Gin Tyr Leu Leu Lys His Phe Phe Lys Leu Ser Gin 



1488 



93 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/DK98/0014S 



485 



94 

495 



ACC TCC AGC AAA AAT CTG TTG AAT GCA AGA GTA CTC TCT GAA ATT TTC 
Thr Ser Ser Lys Asn I.eu Leu Asn Ala Arg Val Leu Ser S 
"° 505 

AGC CCT ATG CTT TTC AGA TTC TCA GCA GCC AGC TCT GAT AAT ACT GAA 
ser pro Met Leu Phe Arg Phe Ser Ala Ala Ser Ser A^p ^ 
515 520 525 

AAC CTC ATA AAA GTT ATA GAA ATT TTA ATC TCA ACT GAA TGG AAT GAA 
Asn Leu lie Lys Val lie Glu He Leu He Ser Thr Glu Trp ^ 

535 540 

CGA CAG CCT GCA CCA GCA CTG CCT CCT AAA CCA CCA AAA CCT ACT ACT 
Arg Gin Pro Ala Pro Ala Leu Pro Pro Lys Pro Pro Lys Pro ThI Jhr 

val Ala Asn Asn Gly Met Asn Asn Asn Met Ser Leu Gin Asn Ala Glu 
565 570 

S ^Vr ^ro mv '^'^ AAT GAA AAA CTT CGA 

Trp Tyr Trp Gly Asp He Ser Arg Glu Glu Val Asn Glu Lys Leu Arg 
580 585 590 

GAT ACA GCA GAC GGG ACC TTT TTG GTA CGA GAT GCG TCT ACT AAA ATC 
ASP Thr Ala ASP Gly Thr Phe Leu Val Arg Asp Ala Ser J" 
595 goo 



605 



CAT GGT GAT TAT ACT CTT ACA CTA AGG AAA GGG GGA AAT AAC AAA TTA 
Hxs Gly ASP TVr Thr Leu Thr Leu Arg Lys Gly Gly Asn IZ 



620 



ATC AAA ATA TTT CAT CGA GAT GGG AAA TAT GGC TTC TCT GAC CCA TTA 
lie Lys lie Phe His Arg Asp Gly Lys Tyr Gly Phe Ser Asp pS "J 

?hr Phe ser '^'^ '^"'^ ''^ ^^T 

Thr Phe ser Ser Val val Glu Leu He Asn His Tyr Arg Asn Glu Ser 

650 655 

17 ^ '''''' ™ GTG AAA TTA CTT TAT CCA GTA 

I-eu Ala Gin Tyr Asn Pro Lys Leu Asp Val Lys Leu Leu T^r Pro vll 

660 



665 



670 



TCC AAA TAG CAA CAG GAT CAA GTT GTC AAA GAA GAT AAT ATT GAA GCT 
ser Lys Tyr Gin Gin Asp Gin Val Val Lys Glu Asp Z He SI 
675 680 685 

vll G?V ^^r"" ™^ ^'^'^ ■'■^ °AA AAA AGT 

Val Gly Lys Lys Leu His Glu Tyr Asn Thr Gin Phe Gin Glu Lys Ser 

695 

Arg Glu Tyr Asp Arg Leu Tyr Glu Glu Tyr Thr Arg Thr Ser Gin Glu 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



2112 



2160 

94 
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705 710 715 720 

ATC CAA ATG AAA AGG ACA GCT ATT GAA GCA TTT AAT GAA ACC ATA AAA 2208 
lie Gin Met Lys Arg Thr Ala He Glu Ala Phe Asn Glu Thr He Lys 
725 730 735 

ATA TTT GAA GAA CAG TGC CAG ACC CAA GAG CGG TAG AGC AAA GAA TAG 2256 
He Phe Glu Glu Gin Cys Gin Thr Gin Glu Arg Tyr Ser Lys Glu Tyr 
740 745 750 

ATA GAA AAG TTT AAA CGT GAA GGC AAT GAG AAA GAA ATA CAA AGG ATT 2304 
He Glu Lys Phe Lys Arg Glu Gly Asn Glu Lys Glu He Gin Arg He 
755 760 765 

15 ATG CAT AAT TAT GAT AAG TTG AAG TCT CGA ATC AGT GAA ATT ATT GAC 2352 
Met His Asn Tyr Asp Lys Leu Lys Ser Arg He Ser Glu He He Asp 
770 775 780 

AGT AGA AGA AGA TTG GAA GAA GAC TTG AAG AAG CAG GCA GCT GAG TAT 2400 
20 Ser Arg Arg Arg Leu Glu Glu Asp Leu Lys Lys Gin Ala Ala Glu Tyr 
785 790 795 800 

CGA GAA ATT GAC AAA CGT ATG AAC AGC ATT AAA CCA GAC CTT ATC CAG 2448 
Arg Glu He Asp Lys Arg Met Asn Ser He Lys Pro Asp Leu He Gin 
25 805 810 815 

CTG AGA AAG ACG AGA GAC CAA TAG TTG ATG TGG TTG ACT CAA AAA GGT 2496 

Leu Arg Lys Thr Arg Asp Gin Tyr Leu Met Trp Leu Thr Gin Lys Gly 

820 625 830 

30 

GTT CGG CAA AAG AAG TTG AAC GAG TGG TTG GGC AAT GAA AAC ACT GAA 2544 

Val Arg Gin Lys Lys Leu Asn Glu Trp Leu Gly Asn Glu Asn Thr Glu 

835 840 845 

35 GAC CAA TAT TCA CTG GTG GAA GAT GAT GAA GAT TTG CCC CAT CAT GAT 2592 
Asp Gin Tyr Ser Leu Val Glu Asp Asp Glu Asp Leu Pro His His Asp 
850 855 860 

GAG AAG ACA TGG AAT GTT GGA AGC AGC AAC CGA AAC AAA GCT GAA AAC 2640 
40 Glu Lys Thr Trp Asn Val Gly Ser Ser Asn Arg Asn Lys Ala Glu Asn 
865 870 875 880 

CTG TTG CGA GGG AAG CGA GAT GGC ACT TTT CTT GTC CGG GAG AGC AGT 2 688 
Leu Leu Arg Gly Lys Arg Asp Gly Thr Phe Leu Val Arg Glu Ser Ser 
45 885 890 895 

AAA CAG GGC TGC TAT GCC TGC TCT GTA GTG GTG GAC GGC GAA GTA AAG 2736 

Lys Gin Gly Cys Tyr Ala Cys Ser Val Val Val Asp Gly Glu Val Lys 
900 905 910 

50 

CAT TGT GTC ATA AAC AAA ACA GCA ACT GGC TAT GGC TTT GCC GAG CCC 2784 

His Cys Val He Asn Lys Thr Ala Thr Gly Tyr Gly Phe Ala Glu Pro 
915 920 925 

55 TAT AAC TTG TAG AGC TCT CTG AAA GAA CTG GTG CTA CAT TAC CAA CAC 2832 
Tyr Asn Leu Tyr Ser Ser Leu Lys Glu Leu Val Leu His Tyr Gin His 
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96 

^30 935 

ACC TCC CTT GTG CAG CAC AAC GAC TCC CTC AAT GTC ACA CTA GCC TAG 
Thr ser l.eu Val Gin His Asn Asp Ser Leu Asn Val ThJ llu 111 

955 

CCA GTA TAT GCA CAG CAG AGG CGA TGA 

Pro val Tyr Ala Gin Gin Arg Arg ^^^'^ 
965 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 968 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49; 

Met val ser Lys Gly GIu Glu Leu Phe Thr Gly Val Val Pro He Leu 

10 ic 

val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

25 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

Cys Thr Thr Gly Lys Leu Pro til Pro Trp Pro Thr IL Val Thr Thr 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr pJo Asp His Met Lys 

Gin His ASP Phe Phe Lys Ser Ala Met Pro gL Gly Tyr Val Gin Glu 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg IL Glu 

105 

val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu ^^u Lys Gly 

120 125 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

Asn Tyr Asn Ser His Asn III Tyr He Met Ala l^p Lys Gin Lys Asn 

Gly He Lys Val Asn Phe Lys He Arg His III He Glu Asp Gly 

170 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly ^p Gly 

Pro val Leu Leu Pro Asp Asn His lyr Leu Ser Thr Gin Ser Ala Leu 

200 205 
ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

val Thr Ala Ala Gly lie HI Leu Gly Met Asp G^u Leu Tyr Lys Ser 

Gly Leu Arg Ser Met Ser Ala Glu Gly Tyr Gin Tyr Arg Ala Leu ^r 
245 250 255 
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Asp 


Tyr 


Lys 


Lys 
260 


Glu 


Arg 


Glu 


Glu 


Asp 
265 


He 


Asp 


Leu 


His 


Leu 
270 


Gly 


Asp 




lie 


Leu 


Thr 


vax 


Asn 


Lys 


Gly 


Ser 


Leu 


Val 


Ala 


Leu Gly 


Phe 


Ser 


Asp 








275 










280 










285 








c 
O 


Gly 


Gin 
2 90 


Glu 


TV T — 

Ala 


Arg 


Pro 


Glu 
295 


Glu 


He 


Gly 


Trp 


Leu 
300 


Asn 


Gly 


Tyr 


Asn 




Glu 


Thr 


Thr 


Gly 


Glu 


Arg 


Gly 


Asp 


Phe 


Pro 


Gly 


Thr 


Tyr 


Val 


Glu 


Tyr 




3 05 










310 










315 










320 




He 


Gly 


Arg 


Lys 


Lys 


He 


Ser 


Pro 


Pro 


Thr 


Pro 


Lys 


Pro 


Arg 


Pro 


Pro 


10 










325 










330 










335 






Ajrg 


Pro 


Leu 


Pro 


val 


TV T _ 

Ala 


Pro 


Gly Ser 


Ser 


Lys 


Thr 


Glu 


Ala 


Asp 


Val 










340 










345 










350 








Glu 


GJ.n 


355 


Ala 


Leu 


Thr 


Leu 


Pro 
360 


Asp 


Leu 


Ala 


Glu 


Gin 
365 


Phe 


Ala 


Pro 




Pro 


Asp 
3 70 


lie 


Ala 


Pro 


Pro 


Leu 
375 


Leu 


He 


Lys 


Leu 


Val 
380 


Glu 


Ala 


He 


Glu 




Lys 


Lys 


Gly 


Leu 


Glu 


Cys 


Ser 


Thr 


Leu 


Tyr 


Arg 


Thr 


Gin 


Ser 


Ser 


Ser 




385 










390 










395 










400 




Asn 


Leu 


Ala 


Glu 


Leu 


Arg 


Gin 


Leu 


Leu 


Asp 


Cys 


Asp 


Thr 


Pro 


Ser 


Val 


20 










405 










410 










415 






Asp 


Leu 


Glu 


Met 
420 


He 


Asp 


Val 


His 


Val 
425 


Leu 


Ala 


Asp 


Ala 


Phe 
430 


Lys 


Arg 




Tyr 


Leu 


Leu 
435 


Asp 


Leu 


Pro 


Asn 


Pro 
440 


Val 


He 


Pro 


Ala 


Ala 
445 


Val 


Tyr 


Ser 


25 


Glu 


Met 
450 


He 


Ser 


Leu 


Ala 


Pro 
455 


Glu 


Val 


Gin 


Ser 


Ser 
460 


Glu 


Glu 


Tyr 


He 




Gin 


Leu 


Leu 


Lys 


Lys 


Leu 


He 


Arg 


Ser 


Pro 


Ser 


He 


Pro 


His 


Gin 


Tyr 




465 










470 










475 










480 




Trp 


Leu 


Thr 


Leu 


Gin 


Tyr 


Leu 


Leu 


Lys 


His 


Phe 


Phe 


Lys 


Leu 


Ser 


61n 


30 










485 










490 










4 95 






Thr 


Ser 


Ser 


Lys 
500 


Asn 


Leu 


Leu 


Asn 


Ala 
505 


Arg 


Val 


Leu 


Ser 


Glu 
510 


He 


Phe 




Ser 


Pro 


Met 
515 


Leu 


Phe 


Arg 


Phe 


Ser 
520 


Ala 


Ala 


Ser 


Ser 


Asp 
525 


Asn 


Thr 


Glu 




Asn 


Leu 
53 0 


He 


Lys 


Val 


He 


Glu 
535 


He 


Leu 


He 


Ser 


Thr 
540 


Glu 


Trp 


Asn 


Glu 




Arg 


Gin 


Pro 


Ala 


Pro 


Ala 


Leu 


Pro 


Pro 


Lys 


Pro 


Pro 


Lys 


Pro 


Thr 


Thr 




545 










550 










555 










560 




Val 


Ala 


Asn 


Asn 


Gly Met Asn 


Asn 


Asn 


Met 


Ser 


Leu 


Gin 


Asn 


Ala 


Glu 


40 










565 










570 










575 






Trp 


Tyr 


Trp 


Gly 
580 


Asp 


He 


Ser 


Arg 


Glu 
585 


Glu 


Val 


Asn 


Glu 


Lys 
590 


Leu 


Arg 




Asp 


Thr 


Ala 
595 


Asp 


Gly 


Thr 


Phe 


Leu 
600 


Val 


Arg 


Asp 


Ala 


Ser 
605 


Thr 


Lys 


Met 


45 


His 


Gly 


Asp 


Tyr 


Thr 


Leu 


Thr 


Leu 


Arg 


Lys 


Gly 


Gly Asn 


Asn 


Lys 


Leu 






610 










615 










620 












He 


Lys 


He 


Phe 


His 


Arg 


Asp 


Gly 


Lys 


Tyr 


Gly 


Phe 


Ser 


Asp 


Pro 


Leu 




625 










630 










635 










640 




Thr 


Phe 


Ser 


Ser 


Val 


Val 


Glu 


Leu 


He 


Asn 


His 


Tyr 


Arg 


Asn 


Glu 


Ser 


50 










645 










650 










655 






Leu 


Ala 


Gin 


Tyr 
660 


Asn 


Pro 


Lys 


Leu 


Asp 
665 


Val 


Lys 


Leu 


Leu 


Tyr 
670 


Pro 


Val 




Ser 


Lys 


Tyr 
675 


Gin 


Gin 


Asp 


Gin 


Val 
680 


Val 


Lys 


Glu 


Asp 


Asn 
685 


He 


Glu 


Ala 


55 


Val 


Gly 
690 


Lys 


Lys 


Leu 


His 


Glu 
695 


Tyr 


Asn 


Thr 


Gin 


Phe 
700 


Gin 


Glu 


Lys 


Ser 
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35 
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Arg Glu Tyr Asp Arg Leu Tyr Glu Glu Tyr Thr Arg Thr Ser Gin Glu 

lie Gin Met Lys Arg Thr Ala He Glu Ala pJe Asn Glu Thr He Lys 

730 



He Phe Glu Glu Gin Cys Gin Thr Gin Glu Arg Tyr Ser Lys Glu Tyr 

He Glu Lys Phe Lys Arg Glu Gly Isn Glu Lys Glu He III Arg He 

760 

Met His Asn Tyr Asp Lys Leu Lys Ser Arg He Ser Glu He He Asp 

"^"^^ 7B0 

ser Arg Arg Arg Leu Glu Glu Asp Leu Lys Lys Gin Ala Ala Glu Tyr 

805 810 



Arg Glu lie Asp Lys Arg Met Asn Ser He lys Pro Asp Leu He lln 

810 one 

15 Leu Arg Lys Thr Arg Asp Gin Tyr Leu Met Trp Leu Thr Gin Lys Gly 

820 Q25 



Val Arg Gin Lys Lys Leu Asn Glu Trp Leu Gly Asn Glu Asn Thr Glu 

840 345 
ASP Gin TVr ser Leu Val Glu Asp Asp Glu Asp Leu Pro His His Asp 

860 

Glu Lys Thr Trp Asn Val Gly ser Ser Asn Arg Asn Lys Ala Glu Asn 

Leu Leu Arg Gly Lys Arg Asp Gly Thr Phe lIu Val Arg Glu Ser ler 

25 Lys Gin Gly Cys Tyl Ala Cys Ser Val vll val Asp Gly Glu v" Lys 

905 910 
H.S cys val He Asn Lys Thr Ala Thr Gly Tyr Gly Phe Ala Glu Pro 

920 

Tyr Asn Leu Tyr Ser Ser Leu Lys Glu Leu Val Leu His Tyr Gin His 
i^JO 935 



Thr ser Leu Val Gin His Asn Asp Ser Leu Asn Val Thr Leu Ala Tyr 

950 955 _ _ 

Pro Val Tyr Ala Gin Gin Arg Arg 
965 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 2160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2157 
(D) OTHER INFORMATION: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 

mI^ f '^'^ '''''' ''''^ ^GG GTG GTG CCC ATC CTG 

Met val ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He lIu 



5 10 15 



48 
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GTC GAG CTG GAC GGC GAG GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

5 GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 * 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
10 Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
15 65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

20 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC- AAG ACC CGC GCC GAG 33 6 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

25 GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 4 32 

30 lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC T^G CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
35 145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

40 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

45 CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
50 Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
55 225 230 235 240 
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z: se" Ts i^i X r r '•^'^ ^^^^ -° 

u Arg ser Arg Ala Gin Ala Ser Asn Ser Thr Met Ser Ser lie 



250 



255 



Zu pro sje So So ei? r r -° 

260 ^ T^^p i-ys 

265 270 

S SI ^^^^ S= S s S X X 

280 285 
300 

AAG AAA ACA GGA CGA TTA GAT GAG CTT GAG AAA GCC ATC ACC ACT CAA 
X.ys ^ys Thr Gly Arg .eu Asp Glu Leu Glu Lys Ala Xle 



315 



320 



330 

TGG GGA CTG AGT ACA CCA AAT ACG ATA GAT GAG TGG GAT ACA ACA GGC 
Trp Gly Leu Ser Thr Pro Asn Thr He Asp Gin Trp A^p THr oty 

350 

IT. tT. IT. s ^ z z z 

360 

GTA TCC CAT CGA AAA GGA TTG CCA CAT GTT ATA TAT TOC CGA TTA TGG 
Val ser H.s Arg Lys Gly Leu Pro His Val He lyr Cys Arg I™ 

^'^^ 380 

400 

Sr 'Z ir r °^ ^-^^ ™^ °™ 

y u Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val Cys Val Asn Pro 

410 

s= z ^ ^ ?s ?s ^ - -I - 

425 

So ?hr Su h! r °^ ^"-^ TAT 

Arg His Thr Glu He Leu Thr Glu Leu Pro Pro Leu Asp Asp Tyr 

440 

?S ^ S IT. S ^ ^ S S 

460 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 
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SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 PCT/DK98/001 45 

101 

CAG AGT AAT TAT ATT CCA GAA ACG CCA CCT CCT GGA TAT ATC AGT GAA 1440 

Gin Ser Asn Tyr lie Pro Glu Thr Pro Pro Pro Gly Tyr lie Ser Glu 
465 470 475 480 

5 GAT GGA GAA ACA AGT GAC CAA CAG TTG AAT CAA AGT ATG GAC ACA GGC 14 88 

Asp Gly Glu Thr Ser Asp Gin Gin Leu Asn Gin Ser Met Asp Thr Gly 
485 490 495 

TCT CCA GCA GAA CTA TCT CCT ACT ACT CTT TCC CCT GTT AAT CAT AGC 1536 

10 Ser Pro Ala Glu Leu Ser Pro Thr Thr Leu Ser Pro Val Asn His Ser 

500 505 510 

TTG GAT TTA CAG CCA GTT ACT TAG TCA GAA CCT GCA TTT TGG TGT TCA 1584 

Leu Asp Leu Gin Pro Val Thr Tyr Ser Glu Pro Ala Phe Trp Cys Ser 
15 515 520 525 

ATA GCA TAT TAT GAA TTA AAT CAG AGG GTT GGA GAA ACC TTC CAT GCA 1632 

lie Ala Tyr Tyr Glu Leu Asn Gin Arg Val Gly Glu Thr Phe His Ala 
530 535 540 

20 

TCA CAG CCC TCA CTC ACT GTA GAT GGC TTT ACA GAC CCA TCA AAT TCA 1680 

Ser Gin Pro Ser Leu Thr Val Asp Gly Phe Thr Asp Pro Ser Asn Ser 
545 550 555 560 

25 GAG AGG TTC TGC TTA GGT TTA CTC TCC AAT GTT AAC CGA AAT GCC ACG 1728 

Glu Arg Phe Cys Leu Gly Leu Leu Ser Asn Val Asn Arg Asn Ala Thr 
565 570 575 

GTA GAA ATG ACA AGA AGG CAT ATA GGA AGA GGA GTG CGC TTA TAC TAC 1776 

30 Val Glu Met Thr Arg Arg His lie Gly Arg Gly Val Arg Leu Tyr Tyr 

560 585 590 

ATA GGT GGG GAA GTT TTT GCT GAG TGC CTA AGT GAT AGT GCA ATC TTT 1824 

lie Gly Gly Glu Val Phe Ala Glu Cys Leu Ser Asp Ser Ala lie Phe 
35 595 600 605 

GTG CAG AGC CCC AAT TGT AAT CAG AGA TAT GGC TGG CAC CCT GCA ACA 1872 

Val Gin Ser Pro Asn Cys Asn Gin Arg Tyr Gly Trp His Pro Ala Thr 
610 615 620 

40 

GTG TGT AAA ATT CCA CCA GGC TGT AAT CTG AAG ATC TTC AAC AAC CAG 1920 

Val Cys Lys lie Pro Pro Gly Cys Asn Leu Lys lie Phe Asn Asn Gin 
625 630 635 640 

45 GAA TTT GCT GCT CTT CTG GCT CAG TCT GTT AAT CAG GGT TTT GAA GCC 1968 

Glu Phe Ala Ala Leu Leu Ala Gin Ser Val Asn Gin Gly Phe Glu Ala* 
645 650 655 

GTC TAT CAG CTA ACT AGA ATG TGC ACC ATA AGA ATG AGT TTT GTG AAA 2 016 

50 Val Tyr Gin Leu Thr Arg Met Cys Thr lie Arg Met Ser Phe Val Lys 

660 665 670 

GGG TGG GGA GCA GAA TAC CGA AGG CAG ACG GTA ACA AGT ACT CCT TGC 2 064 

Gly Trp Gly Ala Glu Tyr Arg Arg Gin Thr Val Thr Ser Thr Pro Cys 
55 675 680 685 
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S III f " TTG GAC AAA GTA 

Trp lie Glu Leu Hxs Leu Asn Gly Pro Leu Gin Trp Leu Asp til 



700 



TTA ACT CAG ATG GGA TCC CCT TCA GTG CGT TGC TCA AGC ATG TCA TAA 
Leu Thr Gin Met Gly Ser Pro Ser Val Arg Cys Ser Ser Zt sTr 



715 



(2) INFORMATION FOR SEQ ID NO: 51: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 amino acids 

(B) TYPE: amino acid 

(C) STRANBEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met val ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

25 val Glu Leu Asp Gly Asp Val Asn Gly Hi. Lys Phe Ser Val llr Gly 

25 30 



Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

30 sf ^"^^ T ^•^^ Thr 

60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin llu 
35 Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
val Lys Phe Glu Gly Asp Thr Leu Jal Asn Arg He Glu llu Lys Gly 

40 130 """^ '"'^ I-e^ Glu Tyr 

135 240 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

Gly He Lys Val Asn Phe Lys He Arg His III He Glu Asp Gly Ser 

45 val Gin Leu Ala Asp His Tyr Gin Gin lln Thr Pro He Gly III Gly 

180 IOC c J 

J-oo 190 
Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 205 
50 2^0 "et Val Leu Leu Glu Phe 

val Thr Ala Ala Gly He Thr Leu Gly Met Asp 11° Leu Tyr Lys Ser 



Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Thr Met Ser Ser III 

55 Leu Pro Phe Thr Pro Pro Val Val Lys ^g Leu Leu Gly Trp "s Lys 

265 270 



2112 



2160 
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Ser 


Ala 


Gly 


Gly 


Ser 


Gly Gly Ala 


Gly Gly 


Gly 


Glu 


Gin 


Asn 


Gly 


Gin 








275 










280 










285 










Glu 


Glu 


Lys 


Trp 


Cys 


Glu 


Lys 


Ala 


Val 


Lys 


Ser 


Leu 


Val 


Lys 


Lys 


Leu 






290 










295 










300 










5 


Lys 


Lys 


Thr 


Gly 


Arg 


Leu 


Asp 


Glu 


Leu 


Glu 


Lys 


Ala 


He 


Thr 


Thr 


Gin 




305 










310 










315 










320 




Asn 


Cys 


Asn 


Thr 


Lys 


Cys 


Val 


Thr 


He 


Pro 


Ser 


Thr 


Cys 


Ser 


Glu 


He 












325 










330 










335 






Trp 


Gly 


Leu 


Ser 


Thr 


Pro 


Asn 


Thr 


He 


Asp 


Gin 


Trp 


Asp 


Thr 


Thr 


Gly 


10 








340 










345 










350 








Leu 


Tyr 


Ser 


Phe 


Ser 


Glu 


Gin 


Thr 


Arg 


Ser 


Leu 


Asp 


Gly 


Arg 


Leu 


Gin 








355 










360 










365 










Val 


Ser 


His 


Arg 


Lys 


Gly 


Leu 


Pro 


His 


Val 


He 


Tyr 


Cys 


Arg 


Leu 


Trp 






370 










375 










380 










15 


Arg 


Trp 


Pro 


Asp 


Leu 


His 


Ser 


His 


His 


Glu 


Leu 


Lys 


Ala 


He 


Glu 


Asn 




385 










390 










395 










400 




Cys 


Glu 


Tyr 


Ala 


Phe 


Asn 


Leu 


Lys 


Lys 


Asp 


Glu 


Val 


Cys 


Val 


Asn 


Pro 












405 










410 










415 






Tyr 


His 


Tyr 


Gin 


Arg 


Val 


Glu 


Thr 


Pro 


Val 


Leu 


Pro 


Pro 


Val 


Leu 


Val 


20 








420 










425 










430 








Pro 


Arg 


His 


Thr 


Glu 


He 


Leu 


Thr 


Glu 


Leu 


Pro 


Pro 


Leu 


Asp 


Asp 


Tyr 








435 










440 










445 










Thr 


His 


Ser 


He 


Pro 


Glu 


Asn 


Thr 


Asn 


Phe 


Pro 


Ala 


Gly 


He 


Glu 


Pro 






450 










455 










460 










25 


Gin 


Ser 


Asn 


Tyr 


He 


Pro 


Glu 


Thr 


Pro 


Pro 


Pro 


Gly Tyr 


He 


Ser 


Glu 




465 










470 










475 










480 




Asp 


Gly 


Glu 


Thr 


Ser 


Asp 


Gin 


Gin 


Leu 


Asn 


Gin 


Ser 


Met 


Asp 


Thr 


Gly 












485 










490 










495 






Ser 


Pro 


Ala 


Glu 


Leu 


Ser 


Pro 


Thr 


Thr 


Leu 


Ser 


Pro 


Val 


Asn 


His 


Ser 


30 








500 










505 










510 








Leu 


Asp 


Leu 


Gin 


Pro 


Val 


Thr 


Tyr 


Ser 


Glu 


Pro 


Ala 


Phe 


Trp 


Cys 


Ser 








515 










520 










525 










He 


Ala 


Tyr 


Tyr 


Glu 


Leu 


Asn 


Gin 


Arg 


Val 


Gly 


Glu 


Thr 


Phe 


His 


Ala 






530 










535 










540 










35 


Ser 


Gin 


Pro 


Ser 


Leu 


Thr Val Asp Gly 


Phe 


Thr 


Asp 


Pro 


Ser 


Asn 


Ser 




545 










550 










555 










560 




Glu 


Arg 


Phe 


Cys 


Leu 


Gly 


Leu 


Leu 


Ser 


Asn 


Val 


Asn 


Arg 


Asn 


Ala 


Thr 












565 










570 










575 






Val 


Glu 


Met 


Thr 


Arg 


Arg 


His 


He 


Gly 


Arg 


Gly 


Val 


Arg 


Leu 


Tyr 


Tyr 


40 








580 










585 










590 








He 


Gly 


Gly 


Glu 


Val 


Phe 


Ala 


Glu 


Cys 


Leu 


Ser 


Asp 


Ser 


Ala 


He 


Phe 








595 










600 










605 










Val 


Gin 


Ser 


Pro 


Asn 


Cys 


Asn 


Gin 


Arg 


Tyr 


Gly 


Trp 


His 


Pro 


Ala 


Thr 






610 










615 










620 










45 


val 


Cys 


Lys 


He 


Pro 


Pro 


Gly 


Cys 


Asn 


Leu 


Lys 


He 


Phe 


Asn 


Asn 


Gin 




625 










630 










635 










640 




Glu 


Phe 


Ala 


Ala 


Leu 


Leu 


Ala 


Gin 


Ser 


Val 


Asn 


Gin Gly 


Phe 


Glu 


Ala 












645 










650 










655 






Val 


Tyr 


Gin 


Leu 


Thr 


Arg 


Met 


Cys 


Thr 


He 


Arg 


Met 


Ser 


Phe 


Val 


Lys 


50 








660 










665 










670 








Gly 


Trp 


Gly 


Ala 


Glu 


Tyr 


Arg 


Arg 


Gin 


Thr 


Val 


Thr 


Ser 


Thr 


Pro 


Cys 








675 










660 










685 










Trp 


He 


Glu 


Leu 


His 


Leu 


Asn 


Gly 


Pro 


Leu 


Gin 


Trp 


Leu 


Asp 


Lys 


Val 






690 










695 










700 










55 


Leu 


Thr 


Gin 


Met 


Gly 


Ser 


Pro 


Ser 


Val 


Arg 


Cys 


Ser 


Ser 


Met 


Ser 





705 710 715 
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(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2418 
* (D) OTHER INFORMATION: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0.52. 

si^ „=p s:; ?s s s - 



10 



15 



til c?u ir. z s ASP e:? t ^ 

^ap t.iy Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 



25 30 



s Zl s z ^. S S ^. S 

s s s.^ s s.^ s si - s i ;s s s 



55 60 



?hr Glv ™^ CAC ATG AAG 

^eu Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 



70 7- 

'5 80 



90 



95 



S Jhr ?Ie S^f P^"" ^''^ ^^'^ -^^^ CGC GCC GAG 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Sa Su 



120 125 



^r,zr^.^. z s z tr. i-^ - - - - 



140 



J.*^ z z z s ^iT.T.^.zrr'^ 

xyr iie Met Ala Asp Lys Gin Lys Asn 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAG GAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

15 AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
20 Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCG AAT TCA ACC ATG GAC 768 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Asn Ser Thr Met Asp 
25 245 250 255 

AAT ATG TCT ATT ACG AAT ACA CCA ACA AGT AAT GAT GCC TGT CTG AGC 816 
Asn Met Ser He Thr Asn Thr Pro Thr Ser Asn Asp Ala Cys Leu Ser 
260 265 270 

30 

ATT GTG CAT AGT TTG ATG TGC CAT AGA CAA GGT GGA GAG AGT GAA ACA 864 
He Val His Ser Leu Met Cys His Arg Gin Gly Gly Glu Ser Glu Thr 
275 280 285 

35 TTT GCA AAA AGA GCA ATT GAA AGT TTG GTA AAG AAG CTG AAG GAG AAA 912 
Phe Ala Lys Arg Ala He Glu Ser Leu Val Lys Lys Leu Lys Glu Lys 
290 295 300 

AAA GAT GAA TTG GAT TCT TTA ATA ACA GCT ATA ACT ACA AAT GGA GCT 960 
40 Lys Asp Glu Leu Asp Ser Leu He Thr Ala He Thr Thr Asn Gly Ala 
305 310 315 320 

CAT CCT AGT AAA TGT GTT ACC ATA CAG AGA ACA TTG GAT GGG AGG CTT 1008 
His Pro Ser Lys Cys Val Thr He Gin Arg Thr Leu Asp Gly Arg Leu 
45 325 330 335 

CAG GTG GCT GGT CGG AAA GGA TTT CCT CAT GTG ATC TAT GCC CGT CTC 1056 

Gin Val Ala Gly Arg Lys Gly Phe Pro His Val He Tyr Ala Arg Leu 
340 345 350 

50 

TGG AGG TGG CCT GAT CTT CAC AAA AAT GAA CTA AAA CAT GTT AAA TAT 1104 

Trp Arg Trp Pro Asp Leu His Lys Asn Glu Leu Lys His Val Lys Tyr 
355 360 365 

55 TGT CAG TAT GCG TTT GAC TTA AAA TGT GAT AGT GTC TGT GTG AAT CCA 1152 
Cys Gin Tyr Ala Phe Asp Leu Lys Cys Asp Ser Val Cys Val Asn Pro 
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370 



375 



380 



TAT CAC TAC GAA CGA GTT GTA TCA CCT nr-A tv^^ ^ 
Tyr His Tyr Glu Arg Val vll Ser Pro tII t r""" ^"^'^ 

5 3 85 

400 

10 " 4" 415 

SI z SI s .-I! s r ^ sr s 



45 



55 



?S S S= SI -I s ^. S 
Jv- S ?S S SI S S S in '^l S ?S 

r s m s s s s ii.= s s SI SI 

'° 4" 480 

s s K= SI I- - SI s SI s^ s 

30 490 495 

c=S S SI S SI SI SI .^I IS SI ?II IS SI S SI 

s SI s SI .-I ^11 ?ii -I SI IS S SI 

525 

CCA TAC ACA CCT AAT TTO rr-T nr^ ^ 

Pro Tyr Thr Pro ^ J^J CTT CAG CAC 

530 " Mis Leu Gin His 



"5 540 



^ IT. in SH S III if. Ill IS SI SI S SI in SI .-I 

560 

"I III SI HI SI SI SI "I r r 

c« ^^'^ Pro Ala Pro Glu Tyr 

50 ^" 570 

Ss ser SI jii oT r °™ 

580 "^"^ ''^^ Val Gly Glu Thr 



5S5 



590 



^^IIISISSIII--III?II----™ 



1200 
1248 
1296 
1344 
13 92 
1440 
1488 
1536 
1584 
1632 
1680 
1728 
1776 
1824 
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595 600 605 

GAC CCT TCT GGA GGA GAT CGC TTT TGT TTG GGT CAA CTC TCC AAT GTC 1872 
Asp Pro Ser Gly Gly Asp Arg Phe Cys Leu Gly Gin Leu Ser Asn Val 
610 615 620 

CAC AGG ACA GAA GCC ATT GAG AGA GCA AGG TTG CAC ATA GGC AAA GGT 192 0 

His Arg Thr Glu Ala lie Glu Arg Ala Arg Leu His He Gly Lys Gly 
625 630 635 640 

GTG CAG TTG GAA TGT AAA GGT GAA GGT GAT GTT TGG GTC AGG TGC CTT 1968 
Val Gin Leu Glu Cys Lys Gly Glu Gly Asp Val Trp Val Arg Cys Leu 
645 650 655 

15 AGT GAC CAC GCG GTC TTT GTA CAG AGT TAC TAC TTA GAC AGA GAA GCT 2016 

Ser Asp His Ala Val Phe Val Gin Ser Tyr Tyr Leu Asp Arg Glu Ala 
660 665 670 

GGG CGT GCA CCT GGA GAT GCT GTT CAT AAG ATC TAC CCA AGT GCA TAT 2064 
20 Gly Arg Ala Pro Gly Asp Ala Val His Lys He Tyr Pro Ser Ala Tyr 
675 680 685 

ATA AAG GTC TTT GAT TTG CGT CAG TGT CAT CGA CAG ATG CAG CAG CAG 2112 
He Lys Val Phe Asp Leu Arg Gin Cys His Arg Gin Met Gin Gin Gin 
25 690 695 700 

GCG GCT ACT GCA CAA GCT GCA GCA GCT GCC CAG GCA GCA GCC GTG GCA 2160 
Ala Ala Thr Ala Gin Ala Ala Ala Ala Ala Gin Ala Ala Ala Val Ala 
705 710 715 720 

30 

GGA AAC ATC CCT GGC CCA GGA TCA GTA GGT GGA ATA GCT CCA GCT ATC 2208 
Gly Asn He Pro Gly Pro Gly Ser Val Gly Gly He Ala Pro Ala He 
725 730 735 

35 AGT CTG TCA GCT GCT GCT GGA ATT GGT GTT GAT GAC CTT CGT CGC TTA 2256 
Ser Leu Ser Ala Ala Ala Gly He Gly Val Asp Asp Leu Arg Arg Leu 
740 745 750 

TGC ATA CTC AGG ATG AGT TTT GTG AAA GGC TGG GGA CCG GAT TAC CCA 2304 
40 Cys He Leu Arg Met Ser Phe Val Lys Gly Trp Gly Pro Asp Tyr Pro 
755 760 765 

AGA CAG AGC ATC AAA GAA ACA CCT TGC TGG ATT GAA ATT CAC TTA CAC 2352 
Arg Gin Ser He Lys Glu Thr Pro Cys Trp He Glu He His Leu His 
45 770 775 780 

CGG GCC CTC CAG CTC CTA GAC GAA GTA CTT CAT ACC ATG CCG ATT GCA 2400 
Arg Ala Leu Gin Leu Leu Asp Glu Val Leu His Thr Met Pro He Ala 
785 790 795 800 



50 



55 



GAC CCA CAA CCT TTA GAC TGA 2421 
Asp Pro Gin Pro Leu Asp 
805 



(2) INFORMATION FOR SEQ ID NO: 53: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
<v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Met Val Ser Lys Giy Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

Glu Gly Glu Gly Asp Ala Thr Tyr G^y Lys Leu Thr Leu Lys Phe He 

Cys Thr Thr Gly Lys Leu Pro V^l Pro Trp Pro Thr ieu Val Thr Thr 

60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

Gin His ASP Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

25 Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

105 

val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu llu Lys Gly 

30 III His iys Leu Glu Tyr 

Asn Tyr Asn Ser His Asn III Tyr He Met Ala Up Lys Gin Lys Asn 
Gly He Lys Val Asn Phe Lys He Arg His lln He Glu Asp Gly s"r 

35 val Gin Leu Ala Asp His Tyr Gin Gin lln Thr Pro He Gly Gly 

165 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

2 00 

40 ^" 2^0 """^ """" 

val Thr Ala Ala Gly lie IhI Leu Gly Met Asp llu Leu Tyr Lys Ser 
Gly Leu Arg ser Arg Ala Gin Ala Ser Asn S^r Asn Ser Thr Met Asp 

45 Asn Met Ser He Thr Asn Thr Pro Thr Ser Asn Asp Ala Cys "u Ser 

265 270 

lie val His ser Leu Met Cys His Arg Gin Gly Gly Glu Ser Glu Thr 

50 290 """^ ^« Glu Lys 

300 

Lys ASP Glu Leu Asp Ser Leu He Thr Ala He Thr Thr Asn Gly Ala 
His Pro ser Lys Cys VaJ Thr He Gin Arg T^r Leu Asp Gly Arg III 
55 Gin val Ala Gly Arg Lys Gly Phe Pro His" Val He oyr Ala III Leu 



PCT/DK98/00145 
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Trp 


Arg 


Trp 
355 


Pro 


Asp 


Leu 


His 


Lys 
360 


Asn 


Glu 


Leu 


Lys 


His 

365 


Val 


Lys 


Tyr 




Cys 


Gin 
370 


Tyr 


Ala 


Phe 


Asp 


Leu 
375 


Lys 


Cys 


Asp 


Ser 


Val 
380 


Cys 


Val 


Asn 


Pro 


5 


Tyr 


His 


Tyr 


Glu 


Arg 


Val 


Val 


Ser 


Pro Gly 


He 


Asp 


Leu 


Ser 


Gly 


Leu 




385 










390 










395 










400 




Thr 


Leu 


Gin 


Ser 


Asn 
405 


Ala 


Pro 


Ser 


Ser 


Met 

410 


Met 


Val 


Lys 


Asp 


Glu 
415 


Tyr 




Val 


His 


Asp 


Phe 


Glu Gly Gin 


Pro 


Ser 


Leu 


Ser 


Thr 


Glu 


Gly 


His 


Ser 


10 








420 










425 










430 








lie 


Gin 


Thr 
435 


lie 


Gin 


His 


Pro 


Pro 
440 


Ser 


Asn 


Arg 


Ala 


Ser 
445 


Thr 


Glu 


Thr 




Tyr 


Ser 
450 


Thr 


Pro 


Ala 


Leu 


Leu 
455 


Ala 


Pro 


Ser 


Glu 


Ser 
460 


Asn 


Ala 


Thr 


Ser 


15 


Thr 
465 


Ala 


Asn 


Phe 


Pro 


Asn 
470 


He 


Pro 


Val 


Ala 


Ser 
475 


Thr 


Ser 


Gin 


Pro 


Ala 
480 




Ser 


lie 


Leu 


Gly 


Gly Ser His 


Ser 


Glu 


Gly 


Leu 


Leu 


Gin 


He 


Ala 


Ser 












485 










490 










495 






Gly 


Pro 


Gin 


Pro 


Gly 


Gin 


Gin 


Gin 


Asn 


Gly Phe 


Thr 


Gly 


Gin 


Pro 


Ala 


20 








500 










505 










510 








Thr 


Tyr 


His 
515 


His 


Asn 


Ser 


Thr 


Thr 
520 


Thr 


Trp 


Thr 


Gly 


Ser 
525 


Arg 


Thr 


Ala 




Pro 


Tyr 


Thr 


Pro 


Asn 


Leu 


Pro 


His 


His 


Gin 


Asn 


Gly His 


Leu 


Gin 


His 






530 










535 










540 










25 


His 


Pro 


Pro 


Met 


Pro 


Pro 


His 


Pro 


Gly His 


Tyr 


Trp 


Pro 


Val 


His 


Asn 




545 










550 










555 










560 




Glu 


Leu 


Ala 


Phe 


Gin 
565 


Pro 


Pro 


He 


Ser 


Asn 
570 


His 


Pro 


Ala 


Pro 


Glu 
575 


Tyr 




Trp 


Cys 


Ser 


lie 


Ala 


Tyr 


Phe 


Glu 


Met 


Asp 


Val 


Gin 


Val 


Gly 


Glu 


Thr 


30 








580 










585 










590 








Phe 


Lys 


Val 
595 


Pro 


Ser 


Ser 


Cys 


Pro 
600 


He 


Val 


Thr 


Val 


Asp 
605 


Gly 


Tyr 


Val 




Asp 


Pro 


Ser 


Gly 


Gly Asp Arg 


Phe 


Cys 


Leu Gly Gin Leu 


Ser 


Asn 


Val 






610 










615 










620 










35 


His 
625 


Arg 


Thr 


Glu 


Ala 


He 
630 


Glu 


Arg 


Ala 


Arg 


Leu 
635 


His 


He 


Gly 


Lys 


Gly 
64 0 




Val 


Gin 


Leu 


Glu 


Cys 
645 


Lys 


Gly 


Glu 


Gly 


Asp 
650 


Val 


Trp 


Val 


Arg 


Cys 
655 


Leu 




Ser 


Asp 


His 


Ala 


Val 


Phe 


Val 


Gin 


Ser 


Tyr 


Tyr 


Leu 


Asp 


Arg 


Glu 


Ala 


40 








660 










665 










670 








Gly 


Arg 


Ala 


Pro 


Gly Asp 


Ala 


Val 


His 


Lys 


He 


Tyr 


Pro 


Ser 


Ala 


Tyr 








675 










680 










685 










lie 


Lys 
690 


Val 


Phe 


Asp 


Leu 


Arg 
695 


Gin 


Cys 


His 


Arg 


Gin 
700 


Met 


Gin 


Gin 


Gin 


45 


Ala 
705 


Ala 


Thr 


Ala 


Gin 


Ala 
710 


Ala 


Ala 


Ala 


Ala 


Gin 
715 


Ala 


Ala 


Ala 


Val 


Ala 
720 




Gly 


Asn 


lie 


Pro 


Gly 


Pro 


Gly 


Ser 


val 


Gly Gly 


He 


Ala 


Pro 


Ala 


He 












725 










730 










735 






Ser 


Leu 


Ser 


Ala 


Ala 


Ala 


Gly 


He 


Gly 


Val 


Asp 


Asp 


Leu 


Arg 


Arg 


Leu 


50 








740 










745 










750 








Cys 


lie 


Leu 
755 


Arg 


Met 


Ser 


Phe 


Val 
760 


Lys 


Gly 


Trp 


Gly 


Pro 
765 


Asp 


Tyr 


Pro 




Arg 


Gin 
770 


Ser 


He 


Lys 


Glu 


Thr 
775 


Pro 


Cys 


Trp 


He 


Glu 
780 


He 


His 


Leu 


His 


55 


Arg 
785 


Ala 


Leu 


Gin 


Leu 


Leu 
790 


Asp 


Glu 


Val 


Leu 


His 
795 


Thr 


Met 


Pro 


He 


Ala 
800 
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Asp Pro Gin Pro Leu Asp 
805 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...3117 
(D) OTHER INFORMATION: 

20 '^^^ SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTO 
Met Val ser Lys Gly Glu Glu Leu Phe Thr Gly Val V.l Pro Xle 



10 



15 



25 



35 



45 



PCT/DK98/00145 



10 



15 



;s -I s z s ^. ^ s s ^5 

25 30 



45 



ilr rll Glv r r ACC 

cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 



5= 60 



Su Jhr Sr Gly "It Oil r."" '^^^ '^^^ 

Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

40 '° 75 80 

T"" '^'^ T ^'^^ •^^'^ ^^<^ «^ GGC TAC GTC CAG GAG 

Gin Has ASP Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr vll X X 



90 



95 



2g ?hr lit SJe T f " ™= °CC GAG 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 



HO 



xit ASP ir. G?u t r T 

55 130 ^ ""^^ His Lys Leu Glu Tyr 



48 



96 



30 »« 

40 



192 



240 



288 



336 



50 ^a^ lit s z r f t 

115 Lys Gly 

120 125 



432 



135 
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AAC TAG AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

5 GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
10 Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
15 195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

20 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 72 0 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

25 GGA CTC AGA TCT ACC ATG GCG GGC TGG ATC CAG GCC CAG CAG CTG CAG 768 

Gly Leu Arg Ser Thr Met Ala Gly Trp He Gin Ala Gin Gin Leu Gin 
24S 250 255 

GGA GAC GCG CTG CGC CAG ATG CAG GTG CTG TAC GGC CAG CAC TTC CCC 816 
30 Gly Asp Ala Leu Arg Gin Met Gin Val Leu Tyr Gly Gin His Phe Pro 
260 265 270 

ATC GAG GTC CGG CAC TAC TTG GCC CAG TGG ATT GAG AGC CAG CCA TGG 864 
He Glu Val Arg His Tyr Leu Ala Gin Trp He Glu Ser Gin Pro Trp 
35 275 280 285 

GAT GCC ATT GAC TTG GAC AAT CCC CAG GAC AGA GCC CAA GCC ACC CAG 912 

Asp Ala He Asp Leu Asp Asn Pro Gin Asp Arg Ala Gin Ala Thr Gin 

290 295 300 

40 

CTC CTG GAG GGC CTG GTG CAG GAG CTG CAG AAG AAG GCG GAG CAC CAG 960 

Leu Leu Glu Gly Leu Val Gin Glu Leu Gin Lys Lys Ala Glu His Gin 

305 310 315 320 

45 GTG GGG GAA GAT GGG TTT TTA CTG AAG ATC AAG CTG GGG CAC TAC GCC 1008 
Val Gly Glu Asp Gly Phe Leu Leu Lys He Lys Leu Gly His Tyr Ala 
325 330 335 

ACG CAG CTC CAG AAA ACA TAT GAC CGC TGC CCC CTG GAG CTG GTC CGC 1056 
50 Thr Gin Leu Gin Lys Thr Tyr Asp Arg Cys Pro Leu Glu Leu Val Arg 
340 345 350 

TGC ATC CGG CAC ATT CTG TAC AAT GAA CAG AGG CTG GTC CGA GAA GCC 1104 
Cys He Arg His He Leu Tyr Asn Glu Gin Arg Leu Val Arg Glu Ala 
55 355 360 365 
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AAC AAT TGC AGC TCT CCG OCT GGG ATC CTG GTT GAC GCC ATG TCC CAG 
Asn Asn cys Ser Ser Pro Ala Gly xie Leu Val Asp Ala Met ITr Sn 

•'"S 380 

AAG CAC CTT CAG ATC AAC CAG ACA TTT GAG GAG CTG CGA CTG GTC ACG 
jys Hxs Leu Gin He Asn Gin Thr Phe Glu Glu Leu Arg Su 7.1 

CAG GAC ACA GAG AAT GAG CTG AAG AAA CTG CAG CAG ACT CAG GAG TAC 
Gin ASP Thr Glu Asn Glu Leu Lys Lys Leu Gin Gin Thr G^^ g^u ^r 

410 415 

TTC ATC ATC CAG TAC CAG GAG AGC CTG AGG ATC CAA GCT CAG TTT GCC 
Phe lie He Gin xyr Gin Glu Ser Leu Arg He Gin A^I G^n A^a 
420 



430 



ail llu 1'^'' """^ '''''' CTG AGC CGG GAG ACG GCC 

Gin Leu Ala Gin Leu Ser Pro Gin Glu Arg Leu Ser Arg Glu tS 111 
«->3 440 



445 



CTC CAG CAG AAG CAG GTG TCT CTG GAG GCC TGG TTG CAG CGT GAG GCA 
I.eu Gin Gin Lys Gin Val Ser Leu Glu Ala Trp Leu Sn Sg G^u Ala 

Gin r° '^''^ "^^^ ^ AAG CAC CAG AAG 

Gin Thr Leu Gin Gin Tyr Arg Val Glu Leu Ala Glu Lys His oTn 

475 480 

?Sr Sn III r° ATC CTG GAT GAC GAG 

Thr Leu Gin Leu Leu Arg Lys Gin Gin Thr He He Leu Asp Asp Glu 
''"S 490 



495 



He X ?S r '^'^^ "° ^C GGC GGG CCC 

Leu He Gin Trp Lys Arg Arg Gin Gin Leu Ala Gly Asn Gly Gly Pro 

500 505 510 

Pro Glu G?v ill r° T"" °™ ^'^^ ^= '^^G AAG TTO GCC 

Pro Glu Gly Ser Leu Asp val Leu Gin Ser Trp Cys Glu Lys Leu Ala 

520 

• 111 ill He Sn T"" f ° ^"'^ CAC CTC 

Glu He He Trp Gin Asn Arg Gin Gin He Arg Arg Ala Glu His Leu 

540 

J^s ^ r° '''''' '''''' ''^C CCA GTG GAG GAG ATG CTG GCC GAG 

Gin Gin Leu Pro He Pro Gly Pro Val Glu Glu Met Leu III Su 

"° 555 560 

vlt ^l Sa ill ill ?f f " °=C CTG GTG ACC AGC ACA 

""^^ ^^"^ Asp He He Ser Ala Leu Val Thr Ser Thr 

5« 570 575 

SJe ill i7 ^ CAG GTC CTG AAG ACC CAG ACC AAG 

Phe He He Glu Lys Gin Pro Pro Gin Val Leu" Lys Thr Gin Thr 

585 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 
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TTT GCA GCC ACC GTA CGC CTG CTG GTG GGC GGG AAG CTG AAC GTG CAC 1824 
Phe Ala Ala Thr Val Arg Leu Leu Val Gly Gly Lys Leu Asn Val His 
595 600 605 



5 ATG AAT CCC CCC CAG GTG AAG GCC ACC ATC ATC AGT GAG CAG CAG GCC 1872 
Met Asn Pro Pro Gin Val Lys Ala Thr lie lie Ser Glu Gin Gin Ala 
610 615 620 



AAG TCT CTG CTT AAA AAT GAG AAC ACC CGC AAC GAG TGC AGT GGT GAG 1920 
10 Lys Ser Leu Leu Lys Asn Glu Asn Thr Arg Asn Glu Cys Ser Gly Glu 
625 630 635 640 



ATC CTG AAC AAC TGC TGC GTG ATG GAG TAC CAC CAA GCC ACG GGC ACC 1968 
lie Leu Asn Asn Cys Cys Val Met Glu Tyr His Gin Ala Thr Gly Thr 
15 645 650 655 



CTC AGT GCC CAC TTC AGG AAC ATG TCA CTG AAG AGG ATC AAG CGT GCT 2016 

Leu Ser Ala His Phe Arg Asn Met Ser Leu Lys Arg lie Lys Arg Ala 
660 665 670 

20 

GAC CGG CGG GGT GCA GAG TCC GTG ACA GAG GAG AAG TTC ACA GTC CTG 2064 

Asp Arg Arg Gly Ala Glu Ser Val Thr Glu Glu Lys Phe Thr Val Leu 
675 680 685 

25 TTT GAG TCT CAG TTC AGT GTT GGC AGC AAT GAG CTT GTG TTC CAG GTG 2112 

Phe Glu Ser Gin Phe Ser Val Gly Ser Asn Glu Leu Val Phe Gin Val 

690 695 700 



AAG ACT CTG TCC CTA CCT GTG GTT GTC ATC GTC CAC GGC AGC CAG GAC 2160 
30 Lys Thr Leu Ser Leu Pro Val Val Val lie Val His Gly Ser Gin Asp 
705 710 715 720 



CAC AAT GCC ACG GCT ACT GTG CTG TGG GAC AAT GCC TTT GCT GAG CCG 2208 
His Asn Ala Thr Ala Thr Val Leu Trp Asp Asn Ala Phe Ala Glu Pro 
35 725 730 735 



GGC AGG GTG CCA TTT GCC 
Gly Arg Val Pro Phe Ala 
740 

40 

TGT GAG GCG CTC AAC ATG 
Cys Glu Ala Leu Asn Met 
755 

45 GGC CTG ACC AAG GAG AAC 
Gly Leu Thr Lys Glu Asn 
770 



GTG CCT GAC AAA GTG CTG 
Val Pro Asp Lys Val Leu 
745 

AAA TTC AAG GCC GAA GTG 
Lys Phe Lys Ala Glu Val 
760 

CTC GTG TTC CTG GCG CAG 
Leu Val Phe Leu Ala Gin 
775 780 



TGG CCG CAG CTG 2256 
Trp Pro Gin Leu 
750 

CAG AGC AAC CGG 2304 

Gin Ser Asn Arg 

765 

AAA CTG TTC AAC 2352 
Lys Leu Phe Asn 



AAC AGC AGC AGC CAC CTG GAG GAC TAC AGT GGC CTG TCC GTG TCC TGG 2400 
50 Asn Ser Ser Ser His Leu Glu Asp Tyr Ser Gly Leu Ser Val Ser Trp 
785 790 795 800 



TCC CAG TTC AAC AGG GAG AAC TTG CCG GGC TGG AAC TAC ACC TTC TGG 244 8 

Ser Gin Phe Asn Arg Glu Asn Leu Pro Gly Trp Asn Tyr Thr Phe Trp 
55 805 810 815 
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820 825 830 

CAC TGG AAT GAT GGG GCC ATC CTA GGT TTT GTG AAT AAG CAA CAG GCC 
H.S Trp Asn Asp Gly Ala He I.eu Gly Phe Val Asn Kys G^^ Ala 

840 845 

CAC GAC CTG CTC ATC AAC AAG CCC GAG GGG ACC TTC TTG TTG CGC TTT 
Hxs A.P I,eu Leu He Asn Lys Pro Asp Gly Thr Phe Leu Z 

855 ggo 

AGT GAC TCA GAA ATC GGG GGC ATC ACC ATC GCC TGG AAG TTT GAC TCC 
Ser ASP ser Glu He Gly Gly He Thr He Ala Trp JJs Z Sp sir 
"° 875 880 

Pro G^^ So f TGG AAC era AAA CCA TTC ACC ACG CGG GAT TTC 

Pro Glu Arg Asn Leu Trp Asn Leu Lys Pro Phe Thr Thr Arg Asp Phe 
885 890 895 

TCC ATC AGG TCC CTG GCT GAC CGG CTG GGG GAC CTG AGC TAT CTC ATC 
ser He Arg Ser Leu Ala Asp Arg Leu Gly Asp Leu Ser Tyr Leu He 

905 

Tyr Val Phe Pro Asp Arg Pro Lys Asp Glu Val Phe Ser Lys Tyr Tyr 

920 

ACT CCT GTG CTG GCT AAA GCT GTT GAT GGA TAT GTG AAA CCA CAG ATC 
Thr Pro val Leu Ala Lys Ala Val Asp Gly ^r VaJ Z Pro Sn He 

935 

f ? ^""^ nf °™ ^^'^ "^^ GCT GGG GGC 

Lys Gin val Val Pro Glu Phe Val Asn Ala Ser Ala Asp Ala Gly Gly 

955 960 

AGC AGC GCC ACG TAC ATG GAC CAG GCC CCC TCC CCA GCT GTG TGC CCC 
ser ser Ala Thr Tyr Met Asp Gin Ala Pro Ser Pro SI Si C^s Pro 
965 970 975 

X s ^ ^ - - X r zi s s z - 

985 



2544 



2592 



2640 



2688 



2736 



2784 



2832 



2880 



2928 



990 



Sn ASP Tit 21 r r° *™ 

^ til "^^^ Gl" Thr Met Asp Val Ala Arg His 

1000 



1005 



vll G?u G^J Su f ^ ''"^ ^^-^ TCC CGC CTC 

\olo ''''^ A=P ser Arg Leu 

1015 1020 

£ S S S T T. 21 ?S S S S 

■"•"^^ 1035 1 



3024 



3072 



3120 
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(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1039 amino acids 

(B) TYPE: amino acid 

(C) STRi\NDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



15 


Met 
X 


Val 


Ser 


Lys 


Gly 
5 


Glu 


Glu 


Leu 


Phe 


Thr 
10 


Gly 


Val 


Val 


Pro 


He 
15 


Leu 




val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val 


Asn Gly 


His 


Lys 


Phe 


Ser 


Val 


Ser 


Gly 










20 










25 










30 








Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


He 


20 






35 










40 










45 










Cys 


Thr 
50 


Thr 


Gly 


Lys 


Leu 


Pro 
55 


Val 


Pro 


Trp 


Pro 


Thr 
60 


Leu 


Val 


Thr 


Thr 




Leu 


Thr 


Tyr 


Gly 


Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 


Pro 


Asp 


His 


Met 


Lys 




65 










70 










75 










80 


25 


Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly Tyr 


Val 


Gin 


Glu 












85 










90 










95 






Arg 


Thr 


He 


Phe 


Phe 


Lys 


Asp 


Asp Gly 


Asn 


Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 










100 










105 










110 








Val 


Lys 


Phe 


Glu 


Gly 


Asp 


Thr 


Leu 


Val 


Asn 


Arg 


He 


Glu 


Leu 


Lys 


Gly 


30 






115 










120 










125 










lie 


Asp 
130 


Phe 


Lys 


Glu 


Asp 


Gly 
135 


Asn 


He 


Leu 


Gly 


His 
140 


Lys 


Leu 


Glu 


Tyr 




Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


He 


Met 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 




145 










150 










155 










160 


35 


Gly 


He 


Lys 


Val 


Asn 
165 


Phe 


Lys 


He 


Arg 


His 
170 


Asn 


He 


Glu 


Asp 


Gly 

175 


Ser 




Val 


Gin 


Leu 


Ala 
180 


Asp 


His 


Tyr 


Gin 


Gin 
185 


Asn 


Thr 


Pro 


He 


Gly 
190 


Asp 


Gly 




Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


iSer 


Thr 


Gin 


Ser 


Ala 


Leu 


40 






195 










200 










205 










Ser 


Lys 
210 


Asp 


Pro 


Asn 


Glu 


Lys 
215 


Arg 


Asp 


His 


Met 


Val 
220 


Leu 


Leu 


Glu 


Phe 




Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr 


Leu 


Gly 


Met 


Asp 


Glu 


Leu 


Tyr 


Lys 


Ser 




225 










230 










235 










240 


45 


Gly 


Leu 


Arg 


Ser 


Thr 


Met 


Ala 


Gly Trp 


He 


Gin 


Ala 


Gin 


Gin 


Leu 


Gin 












245 










250 










255 






Gly 


Asp 


Ala 


Leu 


Arg 


Gin 


Met 


Gin 


Val 


Leu 


Tyr 


Gly Gin 


His 


Phe 


Pro 










260 










265 










270 








lie 


Glu 


Val 


Arg 


His 


Tyr 


Leu 


Ala 


Gin 


Trp 


He 


Glu 


Ser 


Gin 


Pro 


Trp 


50 






275 










280 










285 










Asp 


Ala 
290 


He 


Asp 


Leu 


Asp 


Asn 
295 


Pro 


Gin 


Asp 


Arg 


Ala 
300 


Gin 


Ala 


Thr 


Gin 




Leu 


Leu 


Glu 


Gly 


Leu 


Val 


Gin 


Glu 


Leu 


Gin 


Lys 


Lys 


Ala 


Glu 


His 


Gin 




305 










310 










315 










320 


55 


Val 


Gly 


Glu 


Asp 


Gly 


Phe 


Leu 


Leu 


Lys 


He 


Lys 


Leu Gly His 


Tyr 


Ala 



325 330 335 
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Thr cm Leu Gin Lys Thr Tyr Asp Arg Cys Pro Leu Glu Leu Val Arg 
Cys xae Arg H.s Xle Leu Tyr Asn oJu OI„ Arg Leu Val Arg OIu Ala 

5 Asn Asn Cys Ser Ser Pro Ala flv, t , 

370 Asp Ala Met Ser Gin 

Lys His Leu Gin He Asn Gin Thr Phe au, ri„ » 

3 as Leu Arg Leu Val Thr 

Om ASP Xhr Olu Asn Glu Leu Lys Lys Leu Gin Gin Thr Gin Glu ^Jr 
Phe Xle Xle Gin ^r Gin Glu Ser Leu Z xle Gin Ala Gin ill Ala 
cm Leu Ala Gin Leu Ser Pro Gin g" Arg Leu Ser Arg JL" Thr Ala 
15 Leu Oln cm Lys Gin Val Ser Leu Glu Ala Trp Leu Gl" Arg Glu Ala 
Oln Thr Leu Gin Gin Tyr Arg Val Glu Leu Ala Glu" .ys His Gin Lys 
Thr Leu Gin Leu Leu Arg Lys Gin Gin Thr 111 xie Leu Asp Asp Z 
X-eu Xle Gin Trp Lys Arg Arg oin Gin IZ Ala Gly Asn Gly Gly Pro 
pro Glu Gly ser Leu Asp VaX Leu HI ser Trp Cys Glu l^l Leu Ala 



25 Glu Xle Xle Trp Gin Asn Arg III Gin Xle Arg Arg III Glu His Leu 

^-^^ 540 

Glu 



cys Gin Gin Leu Pro Xle Pro Gly Pro Val Clu 5lu Met Leu Ala 



Val Asn Ala Thr xle Thr Aan ti- n ^ f 
30 ^tZ Ser Ala Leu Val Thr Ser Thr 

Phe Xle Xle Glu Lys Gin Pro Pro Gin Val Leu Lys Thr Gin Ihr Lys 

Phe Ala Ala Thr Val Arg Leu Leu v" Gly Gly Lys Leu IZ Val His 

oc -'^-j 600 

Jo Met Asn PT-ft D>-« m« xr^-. ^05 



Met Asn Pro Pro Gin Val Lys Ala Thr Xle Xle Ser oil Gin Gin Ala 
.ys ser Leu Leu Lys Asn Glu Asn Thr Arg Asn all Cys Ser Gly Glu 
^ Xle Leu Asn Asn Cys Cys Val Met Glu xyr g1„ Ala Thr Gly III 

ser Ala His Phe Arg Asn Met Ser III .,s Arg xle Lys III Ala 
ASP Arg Arg Gly Ala Glu Ser Val Thr Glu Glu Lys Phe III Val Leu 
45 Phe Glu ser Gin Phe Ser Val Z ser Asn Glu Leu III Phe Gin Val 

700 



Lys Thr Leu Ser Leu Pro Val Val Val ti» „ -. 
70S lie Val His Gly Ser Gin Asp 

His Asn Ala Thr Ala Thr v»1 t ^ . '"""^ ''^O 
50 ^^"^ Trp Asp Asn Ala Phe Ala Glu Pro 

Oly Arg val Pro Phe Ala Val Pro Asp val Leu Trp Pro Gin Leu 

ex. Glu Ala Leu Asn Met Lys P.e l^l Ala Glu Val Gin III Asn Arg 
55 Gly .eu Thr Lys Glu Asn Leu Val Phe Leu Ala Gin l^l .eu Phe Asn 



■^■^5 780 
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10 



15 



20 



25 



30 



35 



40 



45 



50 









Ser 


His 


Leu 


Glu 


Asp 


Tyr 


Ser 


Gly 


Leu 


Ser 


Ms* 1 
VaX 


Ser 


Trp 


785 










790 










795 










800 


OCX 


CZl n 


XT lie 


Asn 


Arg 


Glu 


Asn 


Leu 


Pro 


Gly Trp 


Asn 


Tyr 


Tnr 


IT US 


Trp 










805 










810 










815 




Gin 


ixp 


flic 


Asp 


Gly 


Val 


Met 




V cl X 


Leu 


Lys 


Lys 




nxo 


Lys 


Pro 








82 0 










82 5 










830 






13 i IS 


Trp 


Asn 


Asp 


Gly 


Ala 


He 


Leu 


oxy 


Phe 


Val 


Asn 


Lys 


oxn 


vaxn 


AX a 






R c: 
O O 3 




















Oil Q 
Oft D 








XIJ. S 


Asp 


Leu 


Leu 


He 


Asn 


Lys 


Pro 


Asp 


Gly Thr 


Phe 


Leu 


Leu 


Arg 


Phe 




Q c n 










855 










Q C A 












Asp 


oer 


uXU 


He Gly Gly 


X j.e 


Thr 


He 


Ala 


TlTp 


Lys 


Pne 


Asp 


Ser 












870 










875 










O O A 


PjTO 


1:1 ±11 


Arg 


Asn 


Leu 


Trp 


Asn 


Leu 


Lys 


Pro 


Phe 


Thr 


Thr 


Arg 


Asp 


13 A 

Fne 










885 










890 










Q QC 




S63r 


Tin 


Arg 


Ser 


Leu 


Ala 


Asp 


Arg 


Leu 


Gly Asp 


Leu 


Ser 


Tyr 


Leu 


ixe 


















905 
















Tyr 


va J. 


Pne 


Pro 


Asp 


Arg 


Pro 


Lys 


Asp 


Glu 


Val 


Phe 


Ser 


Lys 


Tyr 


Tyr 
















920 










925 








Thr 


Pro 


vSi± 


Leu 


Ala 


Lys 


Ala 


Val 


Asp Gly 


Tyr 


Val 


Lys 


Pro 


uxn 


T 1 & 

xxe 




930 










935 










940 










Lys 


Gin 


Val 


Val 


Pro 


Glu 


Phe 


Val 


Asn 


Ala 


Ser 


Ala 


Asp 


Ala 


Gly 


Gly 


945 










950 










955 










960 


Ser 


Ser 


Ala 


Thr 


Tyr 


Met 


Asp 


Gin 


Ala 


Pro 


Ser 


Pro 


Ala 


Val 


Cys 


Pro 










965 










970 










975 




Gin 


Ala 


Pro 


Tyr 


Asn 


Met 


Tyr 


Pro 


Gin 


Asn 


Pro 


Asp 


His 


Val 


Leu 


Asp 








980 










985 










990 






Gin 


Asp 


Gly 


Glu 


Phe 


Asp 


Leu 


Asp 


Glu 


Thr 


Met 


Asp 


Val 


Ala 


Arg 


His 






995 








1000 








1005 








Val 


Glu 


Glu 


Leu 


Leu 


Arg 


Arg 


Pro 


Met 


Asp 


Ser 


Leu 


Asp 


Ser 


Arg 


Leu 


1010 








1015 








1020 










Ser 


Pro 


Pro 


Ala 


Gly 


Leu 


Phe 


Thr 


Ser 


Ala 


Arg 


Gly 


Ser 


Leu 


Ser 





025 1030 1035 1 



(2) INFORMATION FOR SEQ ID N0:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY; Coding Sequence 

(B) LOCATION: 1...1872 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

ATG GCG GCG GCG GCG GCG GCT CCG GGG GGC GGG GGC GGG GAG CCC AGG 4 8 

Met Ala Ala Ala Ala Ala Ala Pro Gly Gly Gly Gly Gly Glu Pro Arg 
15 10 15 

GGA ACT GCT GGG GTC GTC CCG GTG GTC CCC GGG GAG GTG GAG GTG GTG 96 
Gly Thr Ala Gly Val Val Pro Val Val Pro Gly Glu Val Glu Val Val 
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20 



25 



30 



Z S 21 Z 51! S SI ?S S 



«° 45 



iT. Z S S S IS S S= S! 



55 60 



S Six r r "'^'^ 

^^g y rnr Arg Val Ala He Lys Lys He Ser Pro Phe Glu His Gin 



75 



80 



90 



95 



s ^ ?s s ji: s si tii si s si ?s 



110 



m I" SI ill SI in SI s SI r r 

Si* ""^J- lyr ij.e Val Gin Asp Leu Met Glu Thr 



"0 



SI S 21 SI J- »s s= 



"5 



cys 21 Su G^: ?r ^ 
ryr Phe Leu Tyr Gin He Leu Arg Gly Leu Lys Tyr He His Ser 

"° 155 160 

GCC AAT GTG CTG CAC CGG GAC CTG AAG CCT TCC AAT CTG CTT ATP AAr 
Ala Asn val Leu His Arg Asp Leu Lys Pro Ser He 

"0 175 

?hr f ^^'^ ^TG GCC CGG ATT GCT 

Thr Thr cys Asp Leu Lys He Cys Asp Phe Gly Leu Ala Arg S a" 



185 



190 



ASP So G?u Sp ?s ir ^ '''' ^^'^ 

195 ^ "^^^ Tyr Val Ala Thr 

200 



205 



Arg S 111 Pro Gf 

210 ^ "^"^ ser Lys Gly Tyr Thr 



220 



ser He ^ He S I"" '^^^ CTC 

225 ^ ''^^ Ala Glu Met Leu 



235 



240 



ser Arg So Jle pL' T 

Asn Arg Pro He Phe Pro Gly Lys His Tyr Leu Asp Gin Leu Asn 



144 

192 

240 

288 

336 

384 

432 

480 

528 

576 

624 

672 

720 

768 

118 
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245 250 255 

CAC ATT CTA GGT ATC TTG GGT TCC CCA TCC CAG GAG GAC CTT AAT TGC 816 
His lie Leu Gly lie Leu Gly Ser Pro Ser Gin Glu Asp Leu Asn Cys 
260 265 270 

ATC ATT AAC ATG AAG GCC CGA AAC TAG CTG CAG TCT CTG CCC TCG AAA 864 
lie lie Asn Met Lys Ala Arg Asn Tyr Leu Gin Ser Leu Pro Ser Lys 
275 280 285 

ACC AAG GTG GCT TGG GCC AAG CTC TTT CCT AAA TCT GAC TCC AAA GCT 912 
Thr Lys Val Ala Trp Ala Lys Leu Phe Pro Lys Ser Asp Ser Lys Ala 
290 295 300 

15 CTT GAC CTG CTG GAC CGG ATG TTA ACC TTC AAC CCA AAC AAG CGC ATC 960 
Leu Asp Leu Leu Asp Arg Met Leu Thr Phe Asn Pro Asn Lys Arg lie 
305 310 315 320 

ACA GTA GAG GAA GCG CTG GCT CAC CCT TAG CTG GAA CAG TAC TAC GAT 1008 
20 Thr Val Glu Glu Ala Leu Ala His Pro Tyr Leu Glu Gin Tyr Tyr Asp 

325 330 335 

CCG ACA GAT GAG CCA GTG GCC GAG GAG CCA TTC ACC TTC GAC ATG GAG 1056 
Pro Thr Asp Glu Pro Val Ala Glu Glu Pro Phe Thr Phe Asp Met Glu 
25 340 345 350 

CTG GAT GAC CTC CCC AAG GAG CGG CTG AAG GAG TTG ATC TTC CAG GAG 1104 

Leu Asp Asp Leu Pro Lys Glu Arg Leu Lys Glu Leu lie Phe Gin Glu 
355 360 365 

30 

ACA GCC CGC TTC CAG CCA GGG GCG CCA GAG GGC CCC GGG CGC GCC ATG 1152 

Thr Ala Arg Phe Gin Pro Gly Ala Pro Glu Gly Pro Gly Arg Ala Met 

370 375 380 

35 AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA ATT CTT GTT GAA 1200 
Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu 
385 390 395 400 

TTA GAT GGC GAT GTT AAT GGG CAA AAA TTC TCT GTT AGT GGA GAG GGT 124 8 

40 Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Ser Gly Glu Gly 

405 410 415 

GAA GGT GAT GCA ACA TAC GGA AAA CTT ACC CTT AAA TTT ATT TGC ACT 1296 
Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys Thr 
45 420 425 430 

ACT GGG AAG CTA CCT GTT CCA TGG CCA ACG CTT GTC ACT ACT CTC ACT 1344 
Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 
435 440 445 

50 

TAT GGT GTT CAA TGC TTT TCT AGA TAC CCA GAT CAT ATG AAA CAG CAT 13 92 

Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His 
450 455 460 

55 GAC TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT GTA CAG GAA AGA ACT 144 0 

Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr 
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475 



SI ^ ^ Z Z Z ?s s s .^^^ c 

495 

TTT GAA GGT GAT ACC CTT htt rtvt. 

..U O., ™^ S r - - - - - .TX 



510 



TTT AAA GAA GAT GGA AAC ATT CTT nnn n„„ 

... ... ..p S S S »^ ^I JJ.' 



520 



s z 5^ J- s s r r "° 

530 f„ P'^o l-ys Asn Gly iXe 



545 ^ Asp Gly ser Val Gin 



555 



560 



^ s z z 1^ s;: z s 5s s s s 



620 



AA 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 624 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
Met Ala Ala Ala Ala Ala Ala Pro Gly Gly Gly Gly Gly Glu Pro Arg 
Oly Thr Ala Gly Val Val Pro Val Val ^ro Gly Glu Val Glu Hi Val 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



21 s ^ - sr s s x s r.: s s ^ 

590 

Z Z S 2- - CTT ™ ™c r:, CT» 

^ ^ "^'^ I-eu Glu Phe Val Thr 

605 

Al'I Sa G^y ^JI ^ S Jit T T ^ ^ 

y Tiir His Gly Met Asp Glu Leu Tyr Lys Pro Gin Glu 



1875 
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20 

Lys Gly Gin Pro Phe 
35 

lie Gly Glu Gly Ala 
5 50 

Arg Lys Thr Arg Val 
65 

Thr Tyr Cys Gin Arg 
85 

10 Arg His Glu Asn Val 
100 

Leu Glu Ala Met Arg 
115 

Asp Leu Tyr Lys Leu 
15 130 

Cys Tyr Phe Leu Tyr 
145 

Ala Asn Val Leu His 
165 

20 Thr Thr Cys Asp Leu 
180 

Asp Pro Glu His Asp 
195 

Arg Trp Tyr Arg Ala 
25 210 

Lys Ser lie Asp lie 
225 

Ser Asn Arg Pro lie 
245 

30 His He Leu Gly He 
260 

He lie Asn Met Lys 

275 

Thr Lys Val Ala Trp 
35 290 

Leu Asp Leu Leu Asp 
305 

Thr Val Glu Glu Ala 
325 

40 Pro Thr Asp Glu Pro 
340 

Leu Asp Asp Leu Pro 
355 

Thr Ala Arg Phe Gin 
45 370 

Ser Lys Gly Glu Glu 
385 

Leu Asp Gly Asp Val 
405 

50 Glu Gly Asp Ala Thr 
420 

Thr Gly Lys Leu Pro 
435 

Tyr Gly Val Gin Cys 
55 450 

Asp Phe Phe Lys Ser 



121 

25 



Asp 


Val 


Gly 




Arg 


xyi: 






40 








Tvr 


Glv 


Met 


Val 


Ser 


Ser 




55 










Ala 


He 


Lys 


Lys 


He 


Ser 


70 










75 


Thr 


Leu 


Aro 


Glu 


He 


Gin 














He 


Gly 




Arg 


Asp 


Tl f> 














sp 


vctX 


Tyr 


J. X6 


Vox 


f3l n 
VJX 11 






Ton 








eu 


Lys 


Ser 


r!l n 
Vj J-il 


uXil 


Leu 














Gin 


He 


Leu 


Arg 


Gly 


Leu 












155 


Arg 


Asp 


Leu 


Lys 


Pro 


Ser 










170 




Lys 


He 


Cys 


Asp 


Fne 


ts±y 








185 






ti.± s 


Thr 


Gly 


Phe 


Leu 


Thr 






200 








Pro 


Glu 


He 


Met 


Leu 


Asn 




215 










Trp 


Ser 


Val 


Gly 


Cys 


T T o 

X j.e 


^ J u 














Pro Gly Lys 


X1J.S 


Tyr 










OCA 




Leu 


Gly Ser 


Pro 


Ser 


Gin 








265 






Ala 


Arg 


Asn 


Tyr 


Leu 








280 










Lys 


Leu 


Phe 


Pro 


Lys 




295 










Arg 


Met 


Leu 


Thr 


irne 


Asn 


1 "t n 










lie 


Leu 


Ala 


His 


Pro 


Tyr 


Leu 










330 




va J. 


Ala 


Glu 


Glu 


Pro 


Pne 








345 






Lys 


Glu 


Arg 


Leu 


Lys 


Glu 






360 








Pro 


Gly Ala 


Pro 


Glu 


Gly 




375 










Leu 


Phe 


Thr 


Gly 


Val 


Val 


390 










395 


Asn 


Gly Gin 


Lys 


Phe 


Ser 










410 




Tyr 


Gly 


Lys 


Leu 


Thr 


Leu 








425 






Val 


Pro 


Trp 


Pro 


Thr 


Leu 






440 








Phe 


Ser 


Arg 


Tyr 


Pro 


Asp 




455 










Ala 


Met 


Pro 


Glu 


Gly 


Tyr 
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30 

Thr Gin Leu Gin Tyr 
45 

Ala Tyr Asp His Val 
60 

Pro Phe Glu His Gin 
80 

He Leu Leu Arg Phe 
95 

Leu Arg Ala Pro Thr 
110 

Asp Leu Met Glu Thr 

125 

Ser Asn Asp His He 
140 

Lys Tyr He His Ser 
160 

Asn Leu Leu He Asn 
175 

Leu Ala Arg He Ala 
190 

Glu Tyr Val Ala Thr 
205 

Ser Lys Gly Tyr Thr 
220 

Leu Ala Glu Met Leu 
240 

Leu Asp Gin Leu Asn 
255 

Glu Asp Leu Asn Cys 

270 

Ser Leu Pro Ser Lys 
285 

Ser Asp Ser Lys Ala 
300 

Pro Asn Lys Arg He 
320 

Glu Gin Tyr Tyr Asp 
335 

Thr Phe Asp Met Glu 
350 

Leu He Phe Gin Glu 
365 

Pro Gly Arg Ala Met 
380 

Pro He Leu Val Glu 
400 

Val Ser Gly Glu Gly 
415 

Lys Phe He Cys Thr 
430 

Val Thr Thr Leu Thr 
445 

His Met Lys Gin His 
460 

Val Gin Glu Arg Thr 
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465 



470 



Xle Phe ryr .y. .sp .sp Oly .sn Xyr .ys HI .Xa OXu Val lyl 

Phe Olu Oly J3P Thr .eu Val .sn .^u .ys Ol. .^p 

Phe .ys Olu ASP Oly Asn He Leu lly His Lys Met Clu "^l .sn Tyr 
Asn se. His Asn VaX ryr Xle Met Ala Asp .ys P.o lyl .sn Cly lie 
.ys val Asn Phe .ys lie Ar. His Asn Xle Lys IZ oly Ser Val Oln 
Leu Ala Asp His Tyr Oln Cln Asn Thr Pro ^le' oly Asp Oly Pro ^al' 
Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Oln Ser Ala Leu III Lys 
ASP Pro Asn Olu Lys Arg Asp His Mel XXe Leu Leu OXu III VaX Thr 
Ala Ala Oly XXe T.r His OXy '.11 .sp OXu Leu ^r L^s Pro OXn OXu 



(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 18X5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .X8XX 
(D) OTHER INFORMATION: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

IS s° r r r °^ -° 

^ a Ala Ala Ala Ala Gly Pro Glu Met Val Arg Gly Gin Val 

10 15 

z s s r 

P val Gly Pro Arg Tyr Thr Asn Leu Ser Tyr He Gly Glu Gly 

G^^ SIT SI S Z r r ^ - CGA 

35 ^ ^ '^■^^^ Asp Asn Leu Asn Lys Val Arg 

45 

^ SI j;:^ ^r. s !r r ™= 

Y i.ys lie ser Pro Phe Glu His Gin Thr Tyr Cys Gin 
^5 60 

S lIu Tu il'. 

65 ^ ''y^ Arg Phe Arg His Glu Asn 



48 



96 



144 



192 



240 
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ATC ATC GGC ATC AAT GAC ATC ATC CGG GCA CCA ACC ATT GAG GAG ATG 288 

lie lie Gly lie Asn Asp lie lie Arg Ala Pro Thr lie Glu Gin Met 
85 90 95 

5 

AAA GAT GTA TAT ATA GTA GAG GAC CTC ATG GAG ACA GAT CTT TAG AAG 336 

Lys Asp Val Tyr lie Val Gin Asp Leu Met Glu Thr Asp Leu Tyr Lys 
100 105 110 

10 CTC TTG AAG ACA CAG CAC CTC AGC AAT GAT CAT ATC TGC TAT TTT CTT 384 
Leu Leu Lys Thr Gin His Leu Ser Asn Asp His lie Cys Tyr Phe Leu 
115 120 125 

TAT CAG ATC CTG AGA GGA TTA AAG TAT ATA CAT TCA GCT AAT GTT CTG 432 
15 Tyr Gin lie Leu Arg Gly Leu Lys Tyr lie His Ser Ala Asn Val Leu 
130 135 140 

CAC CGT GAC CTC AAG CCT TCC AAC CTC CTG CTG AAC ACC ACT TGT GAT 4 80 

His Arg Asp Leu Lys Pro Ser Asn Leu Leu Leu Asn Thr Thr Cys Asp 
20 145 150 155 160 

CTC AAG ATC TGT GAC TTT GGC CTT GCC CGT GTT GCA GAT CCA GAC CAT 528 

Leu Lys lie Cys Asp Phe Gly Leu Ala Arg Val Ala Asp Pro Asp His 
165 170 175 

25 

GAT CAT ACA GGG TTC TTG ACA GAG TAT GTA GCC ACG CGT TGG TAC AGA 576 

Asp His Thr Gly Phe Leu Thr Glu Tyr Val Ala Thr Arg Trp Tyr Arg 
180 185 190 

30 GCT CCA GAA ATT ATG TTG AAT TCC AAG GGT TAT ACC AAG TCC ATT GAT 624 
Ala Pro Glu lie Met Leu Asn Ser Lys Gly Tyr Thr Lys Ser lie Asp 

195 200 205 

ATT TGG TCT GTG GGC TGC ATC CTG GCA GAG ATG CTA TCC AAC AGG CCT 672 
35 lie Trp Ser Val Gly Cys lie Leu Ala Glu Met Leu Ser Asn Arg Pro 
210 215 220 

ATC TTC CCA GGA AAG CAT TAC CTT GAC CAG CTG AAT CAC ATC CTG GGT 720 
lie Phe Pro Gly Lys His Tyr Leu Asp Gin Leu Asn His He Leu Gly 
40 225 230 235 240 

ATT CTT GGA TCT CCA TCA CAG GAA GAT CTG AAT TGT ATA ATA AAT TTA 768 
He Leu Gly Ser Pro Ser Gin Glu Asp Leu Asn Cys He He Asn Leu 
245 250 255 

45 

AAA GCT AGA AAC TAT TTG CTT TCT CTC CCG CAC AAA AAT AAG GTG CCG 816 
Lys Ala Arg Asn Tyr Leu Leu Ser Leu Pro His Lys Asn Lys Val Pro 
260 265 270 

50 TGG AAC AGG TTG TTC CCA AAC GCT GAC TCC AAA GCT CTG GAT TTA CTG 864 
Trp Asn Arg Leu Phe Pro Asn Ala Asp Ser Lys Ala Leu Asp Leu Leu 
275 280 285 

GAT AAA ATG TTG ACA TTT AAC CCT CAC AAG AGG ATT GAA GTT GAA CAG 912 
55 Asp Lys Met Leu Thr Phe Asn Pro His Lys Arg He Glu Val Glu Gin 
290 295 300 
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45 



50 
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124 

SI S til ™ - ca> o-. 

305 ^"''^ ''^^ ^^"^ Tyr Asp Pro Ser Asp Glu 

5 315 

s "I ^^i s ^ s= s= ^ s 

350 

15 Sn So S lyr S ler SI^ T p"^ ^^'^ 

^ J Tyr Arg Ser Met Asp Pro Pro Val Ala Thr Met Val Ser 

365 

2| 2^ S Z ^J." s S S S 

!5 "° 395 

S fi^ S ^£ ti;^.^^^-^- s ?S 

410 



960 



1008 



1056 



1104 



1152 



1200 



1248 



430 



35 Tly vlt Sn Cys SSe' sS T p'" 

435 ''''^ Jin ""^P ''^^ His Asp 

445 

TTC TTC AAG TCC GCC ATG rrn i-nA 

Phe Phe Lys Ser All mH Pro '''^^ ^-^^ ATC 

450 Val Gin Glu Arg Thr He 



460 



pJe SJs Sp S r -C 

y ASP Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe 

475 

^ s z ?s s ^ r s s: s s= - 

490 

AAG GAG GAC GGC AAC ATT (-TV r^cr^ 

^v. o.. ..p - - - - - - «C J., ^.c 



1296 



1344 



1392 



1440 



1488 



1536 



55 C.O acc .Tc isa4 

y lie Met Ala Asp Lys Gin Lys Asn Gly He Lys 



520 
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GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG GAG CTC 1632 
Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu 
530 535 540 

5 

GCC GAC CAC TAG CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG GTG 1680 
Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu 
545 550 555 560 

10 CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC 1728 
Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp 
565 570 575 

CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC 1776 
15 Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala 
580 585 590 

GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AA GTAA 1815 
Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
20 595 600 



(2) INFORMATION FOR SEQ ID NO: 59: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



30 



35 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



Met Ala Ala Ala Ala Ala Ala Gly Pro Glu Met Val Arg Gly Gin Val 

15 10 15 

Phe Asp Val Gly Pro Arg Tyr Thr Asn Leu Ser Tyr He Gly Glu Gly 
20 25 30 

40 Ala Tyr Gly Met Val Cys Ser Ala Tyr Asp Asn Leu Asn Lys Val Arg 
35 40 45 

Val Ala He Lys Lys He Ser Pro Phe Glu His Gin Thr Tyr Cys Gin 

50 55 60 

Arg Thr Leu Arg Glu He Lys He Leu Leu Arg Phe Arg His Glu Asn 
45 65 70 75 80 

He He Gly He Asn Asp He He Arg Ala Pro Thr He Glu Gin Met 

85 90 95 

Lys Asp Val Tyr He Val Gin Asp Leu Met Glu Thr Asp Leu Tyr Lys 
100 105 110 

50 Leu Leu Lys Thr Gin His Leu Ser Asn Asp His He Cys Tyr Phe Leu 
115 120 125 

Tyr Gin He Leu Arg Gly Leu Lys Tyr He His Ser Ala Asn Val Leu 

130 135 140 

His Arg Asp Leu Lys Pro Ser Asn Leu Leu Leu Asn Thr Thr Cys Asp 
55 145 150 155 160 

Leu Lys He Cys Asp Phe Gly Leu Ala Arg Val Ala Asp Pro Asp His 
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165 



ASP His Thr Gly Phe Leu Thr Glu Tyr III Ala Thr Arg Trp Arg 

Ala Pro Glu lie Met Leu Asn Ser tyl oly Tyr Thr Lys III He Asp 

200 2fm 
lie Trp ser Val Gly Cys lie Leu Ala Glu Met Leu Ser Asn Arg Pro 

lie Phe Pro Gly Lys His ?Jr Leu Asp Gin Leu Asn His He Leu Gly 

He Leu Gly Ser Pro Ser Gin Glu Asp Leu lln Cys He He Asn III 



250 



Lys Ala Arg Asn Tyr Leu Leu Ser Leu Pro His Lys Asn Lys "l Pro 



265 



Trp Asn Arg Leu Phe Pro Asn Ala Asp Ser Lys Ala Leu Asp Leu Leu 

280 285 
ASP Lys Met Leu Thr Phe Asn Pro His Lys Arg He Glu Val Glu Gin 

Ala Leu Ala His Pro Tyr Leu Glu Gin Tyr Tyr Isp Pro Ser Asp Glu 

3 15 

Pro He Ala Glu Ala Pro Phe Lys Phe Asp Met Glu Leu Asp Asp ^u" 
Pro Lys Glu Lys Leu Lys Glu Leu He Phe Glu Glu Thr Ala Arg Phe 
Oln Pro Gly Tyr Arg Ser Met Asp p" Pro Val Ala Thr mH val Ser 



360 



Lys Gly Glu Glu Leu Phe Thr Val Val Pro He Leu Val Glu Leu 



370 



375 



— • " 380 
ASP Gly ASP val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu 



390 



Gly ASP Ala Thr ^r o'ly Lys Leu Thr Leu Lys Phe He Cys Thr Z 



405 



Gly Lys Leu Pro Val Pro Trp Pro Thr 111 Val Thr Thr Leu T^r Tyr 

Gly val Gin cys Phe Ser Arg ^r III Asp His Met Lys Gin His Asp 

Phe Phe Lys Ser Ala Met Pro 111 oly Tyr Val Gin gIu Arg Thr He 

4 55 460 
Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe 

Clu Gly ASP Thr Leu Val Asn Arg He Glu til Lys Gly Xle Asp III 

Lys Glu ASP Oly Asn He Leu Gly His "s" Leu Glu Tyr Asn ^^r Asn 

ser His Asn Val Tyr He Met Ala As°p Lys Gin Lys Asn Gly He Lys 

520 59"; 
val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu 

Ala ASP His Tyr Gin Gin Asn Thr Pro He Gly ^ Gly Pro Val Leu 

Leu Pro ASP Asn His Tyr Leu Ser Thr Gin Se" Ala Leu Ser Lys As'p 

Pro Asn Glu Lys Arg Asp His Met Val III Leu Glu Phe Val Thr Ala 

Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys "° 



600 
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(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2511 base pairs 

5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: cDNA 
10 <ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2508 
(D) OTHER INFORMATION: 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



ATG GAG CTG GAA AAC ATC GTG GCC AAC ACG GTC TTG CTG AAA GCC AGG 4 8 

Met Glu Leu Glu Asn lie Val Ala Asn Thr Val Leu Leu Lys Ala Arg 
20 1 5 10 15 

GAA GGG GGC GGA GGA AAG CGC AAA GGG AAA AGC AAG AAG TGG AAA GAA 96 
Glu Gly Gly Gly Gly Lys Arg Lys Gly Lys Ser Lys Lys Trp Lys Glu 
20 25 30 

25 

ATC CTG AAG TTC CCT CAC ATT AGC CAG TGT GAA GAC CTC CGA AGG ACC 144 
lie Leu Lys Phe Pro His lie Ser Gin Cys Glu Asp Leu Arg Arg Thr 
35 40 45 

30 ATA GAC AGA GAT TAG TGC AGT TTA TGT GAC AAG CAG CCA ATC GGG AGG 192 
lie Asp Arg Asp Tyr Cys Ser Leu Cys Asp Lys Gin Pro lie Gly Arg 
50 55 60 

CTG CTT TTC CGG CAG TTT TGT GAA ACC AGG CCT GGG CTG GAG TGT TAC 24 0 

35 Leu Leu Phe Arg Gin Phe Cys Glu Thr Arg Pro Gly Leu Glu Cys Tyr 
65 70 75 80 

ATT CAG TTC CTG GAC TCC GTG GCA GAA TAT GAA GTT ACT CCA GAT GAA 288 
lie Gin Phe Leu Asp Ser Val Ala Glu Tyr Glu Val Thr Pro Asp Glu 
40 85 90 95 

AAA CTG GGA GAG AAA GGG AAG GAA ATT ATG ACC AAG TAC CTC ACC CCA 336 

Lys Leu Gly Glu Lys Gly Lys Glu lie Met Thr Lys Tyr Leu Thr Pro 
100 105 110 

45 

AAG TCC CCT GTT TTC ATA GCC CAA GTT GGC CAA GAC CTG GTC TCC CAG 384 

Lys Ser Pro Val Phe lie Ala Gin Val Gly Gin Asp Leu Val Ser Gin 

115 120 125 

50 ACG GAG GAG AAG CTC CTA CAG AAG CCG TGC AAA GAA CTC TTT TCT GCC 432 
Thr Glu Glu Lys Leu Leu Gin Lys Pro Cys Lys Glu Leu Phe Ser Ala 
130 135 140 

TGT GCA CAG TCT GTC CAC GAG TAC CTG. AGG GGA GAA CCA TTC CAC GAA 4 80 

55 Cys Ala Gin Ser Val His Glu Tyr Leu Arg Gly Glu Pro Phe His Glu 
145 150 155 160 
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TAT CTG GAC AGC ATG TTT ttt r-Ao r^r^r^ rr.r^^ 

... ... S S S o'J^ S S ™ 

"5 170 

zi iTn III ei? e r s:^ '^^^ 

180 ^ ^'^Sr Gin Tyr Arg Val Leu 



185 



190 



S ^ S V:. SI S S.^ 12 S S 



205 



z SI s s iz z z s s ^. 

220 

i S S 22 S Z X X ^ 

"° 235 240 

tit ill Z r °f °- ^ 



250 



255 



S SI HI ?S Z S ^T. 

270 



285 



Z Iti Su r 

2 90 Asp Leu His Arg 



300 



Tu ?S ?S IT, zi r r 

vai Tyr Arg Asp Leu Lys Pro GIu Asn He Leu Leu Asp 

320 

ASP Sr Sy S I'" ^^"^ "° 

yr Giy Has lie Arg He Ser Asp Leu Gly Leu Ala Val Lys He 

330 



345 



350 



Itl r JJ*^ r "'^ ""^ 

355 ^^"^ ^In Arg Tyr Gly Leu Ser Pro Asp Tyr 



365 



52 8 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 
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TTC CGC GGC CGT AAG GAG AAG GTG AAG CGG GAG GAG GTG GAG CGC CGG 12 00 

Phe Arg Gly Arg Lys Glu Lys Val Lys Arg Glu Glu Val Asp Arg Arg 
385 390 395 400 

5 

GTC CTG GAG ACG GAG GAG GTG TAG TCC CAC AAG TTC TCC GAG GAG GCC 1248 

Val Leu Glu Thr Glu Glu Val Tyr Ser His Lys Phe Ser Glu Glu Ala 
405 410 415 

10 AAG TCC ATC TGC AAG ATG CTG CTC ACG AAA GAT GCG AAG CAG AGG CTG 1296 
Lys Ser lie Cys Lys Met Leu Leu Thr Lys Asp Ala Lys Gin Arg Leu 
420 425 430 

GGC TGC CAG GAG GAG GGG GCT GCA GAG GTC AAG AGA CAC CCC TTC TTC 1344 
15 Gly Cys Gin Glu Glu Gly Ala Ala Glu Val Lys Arg His Pro Phe Phe 
435 440 445 

AGG AAC ATG AAC TTC AAG CGC TTA GAA GCC GGG ATG TTG GAC CCT CCC 1392 
Arg Asn Met Asn Phe Lys Arg Leu Glu Ala Gly Met Leu Asp Pro Pro 
20 450 455 460 

TTC GTT CCA GAC CCC CGC GCT GTG TAC TGT AAG GAC GTG CTG GAC ATC 144 0 

Phe Val Pro Asp Pro Arg Ala Val Tyr Cys Lys Asp Val Leu Asp lie 
465 470 475 480 

25 

GAG CAG TTC TCC ACT GTG AAG GGC GTC AAT CTG GAC CAC ACA GAC GAC 1488 

Glu Gin Phe Ser Thr Val Lys Gly Val Asn Leu Asp His Thr Asp Asp 

485 490 495 

30 GAC TTC TAC TCC AAG TTC TCC ACG GGC TCT GTG TCC ATC CCA TGG CAA 1536 
Asp Phe Tyr Ser Lys Phe Ser Thr Gly Ser Val Ser lie Pro Trp Gin 
500 505 510 

AAC GAG ATG ATA GAA ACA GAA TGC TTT AAG GAG CTG AAC GTG TTT GGA 1584 
35 Asn Glu Met He Glu Thr Glu Cys Phe Lys Glu Leu Asn Val Phe Gly 
515 520 525 

CCT AAT GGT ACC CTC CCG CCA GAT CTG AAC AGA AAC CAC CCT CCG GAA 1632 
Pro Asn Gly Thr Leu Pro Pro Asp Leu Asn Arg Asn His Pro Pro Glu 
40 530 535 540 

CCG CCC AAG AAA GGG CTG CTC CAG AGA CTC TTC AAG CGG CAG CAT CAG 1680 

Pro Pro Lys Lys Gly Leu Leu Gin Arg Leu Phe Lys Arg Gin His Gin 

545 550 555 560 

45 

AAC AAT TCC AAG AGT TCG CCC AGC TCC AAG ACC AGT TTT AAC CAC CAC 172 8 

Asn Asn Ser Lys Ser Ser Pro Ser Ser Lys Thr Ser Phe Asn His His 
565 570 575 

50 ATA AAC TCA AAC CAT GTC AGC TCG AAC TCC ACC GGA AGC AGC AGG GAT 1776 
He Asn Ser Asn His Val Ser Ser Asn Ser Thr Gly Ser Ser Arg Asp 
580 585 590 

CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG 1824 
55 Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
595 600 605 
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50 



^ iT. ^ ^ z v£ ^. 

i v.^ s s zi s 5 s 

635 



650 



655 



665 



670 



2.= ^. X S S= S „»S Z 



680 



685 



745 



750 



z z". i^i z ji: - 5- - - - - - 



Z S X S IS 



45 "^"^^ 780 



pro ne' lly Z Gly Pro ^I? 

Giy Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 



795 



800 



?hr m n ^^^'^ "° "'^'^ '^^ GAG AAG CGC GAT CAC ATG 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Sp Sis iTt 

810 

^.^.X^ 5™ S IL^ IS S tl^ ?S S si! 



1872 



1920 



1968 



830 



2064 



2112 



2160 



^ s X X s s z 

25 700 

S X l^^.T^X^,z ?S S -I S 

"° 715 

30 ATC GAG CTG AAG GGC ATC GAG TTC AAQ GAG GAC GGC AAC ATC CTG GGG 
lie Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly lit Ty 

730 

35 ^"""^ '^'^ AAC GTC TAT ATC ATG GCC 2,^« 

oJ> Has Lys Leu Glu Tvr Asn tv>- a^.^ o tt • 2256 

740 ^^'^ Met Ala 



2208 



2304 



2352 



2400 



2448 
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GAG CTG TAG AAG TAA 2511 
Glu Leu Tyr Lys 
835 



(2) INFORMATION FOR SEQ ID NO;61: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 836 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

20 Met Glu Leu Glu Asn lie Val Ala Asn Thr Val Leu Leu Lys Ala Arg 
15 10 15 

Glu Gly Gly Gly Gly Lys Arg Lys Gly Lys Ser Lys Lys Trp Lys Glu 

20 25 30 

lie Leu Lys Phe Pro His lie Ser Gin Cys Glu Asp Leu Arg Arg Thr 
25 35 40 45 

lie Asp Arg Asp Tyr Cys Ser Leu Cys Asp Lys Gin Pro He Gly Arg 

50 55 60 

Leu Leu Phe Arg Gin Phe Cys Glu Thr Arg Pro Gly Leu Glu Cys Tyr 
65 70 75 80 

30 He Gin Phe Leu Asp Ser Val Ala Glu Tyr Glu Val Thr Pro Asp Glu 

85 90 95 

Lys Leu Gly Glu Lys Gly Lys Glu He Met Thr Lys Tyr Leu Thr Pro 

100 105 110 

Lys Ser Pro Val Phe He Ala Gin Val Gly Gin Asp Leu Val Ser Gin 
35 115 120 125 

Thr Glu Glu Lys Leu Leu Gin Lys Pro Cys Lys Glu Leu Phe Ser Ala 

130 135 140 

Cys Ala Gin Ser Val His Glu Tyr Leu Arg Gly Glu Pro Phe His Glu 
145 150 155 160 

40 Tyr Leu Asp Ser Met Phe Phe Asp Arg Phe Leu Gin Trp Lys Trp Leu 

165 170 175 

Glu Arg Gin Pro Val Thr Lys Asn Thr Phe Arg Gin Tyr Arg Val Leu 

180 185 190 

Gly Lys Gly Gly Phe Gly Glu Val Cys Ala Cys Gin Val Arg Ala Thr 
45 195 200 205 

Gly Lys Met Tyr Ala Cys Lys Arg Leu Glu Lys Lys Arg He Lys Lys 

210 215 220 

Arg Lys Gly Glu Ser Met Ala Leu Asn Glu Lys Gin He Leu Glu Lys 
225 230 235 240 

50 Val Asn Ser Gin Phe Val Val Asn Leu Ala Tyr Ala Tyr Glu Thr Lys 

245 250 255 

Asp Ala Leu Cys Leu Val Leu Thr He Met Asn Gly Gly Asp Leu Lys 

260 265 270 

Phe His He Tyr Asn Met Gly Asn Pro Gly Phe Glu Glu Glu Arg Ala 
55 275 280 285 

Leu Phe Tyr Ala Ala Glu He Leu Cys Gly Leu Glu Asp Leu His Arg 
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290 295 



Glu Asn Thr Val Tyr Arg Asp Leu Lys Pro GIu Asn lie Leu Leu Asp 



310 -^Tc 

320 



ASP Tyr Gly His lie Arg He Ser Asp Leu Gly i.eu Ala Val Lys III 



330 



Pro Glu Gly ASP Leu He Arg Gly Arg Val Gly Thr Val Gly Tyr Met 

345 

Ala Pro Glu Val Leu Asn Asn Gin Arg Tyr Gly Leu Ser Pro Asp Tyr 

365 

Trp Gly Leu Gly Cys Leu lie Tyr Glu Met lie Glu Gly Gin Ser Pro 
Phe Arg Gly Arg Lys Glu lys Val Lys Arg Glu Glu Val Asp Arg Arg 



val Leu Glu Thr Glu Glu Val Tyr Ser His ^ys Phe Ser Glu Glu til 



400 



"0 415 



Lys ser He Cys Lys Met Leu Leu Thr Lys Asp Ala Lys Gin l^g Leu 

425 

Gly cys Gin Glu Glu Gly Ala Ala Glu Val Lys Arg His "o Phe Phe 



Arg Asn Met Asn Phe Lys Arg Z^G Glu Ala Gly Met "u Asp Pro Pro 



455 



Phe val Pro Asp Pro Arg Al'I Val Tyr Cys Lys Asp Val Leu Asp He 

Glu Gin Phe ser Thr Val Lys Gly Val Asn III Asp His Thr Asp L'p 

490 

ASP Phe Tyr Ser Lys Phe Ser Thr Gly Ser Val Ser He Pro Jrp Gin 



505 



Asn Glu Met He Glu Thr Glu cys Phe Lys Glu Leu Asn vli Phe Gly 

Pro Asn Gly Thr Leu Pro Pro ^p Leu Asn Arg Asn His Pro Pro Glu 

Pro Pro Lys Lys Gly Leu Leu Gin Arg Leu Phe Itl Arg Gin His Gin 

Asn Asn Ser Lys Ser Ser Pro Ser Ser Lys Thr Ser Phe Asn His h!s 

lie Asn ser Asn His Val Ser Ser Asn llr Thr Gly Ser Ser III Asp 

585 



Pro Pro val Ala Thr Met Val Ser Lys Gly Glu Glu Leu III Thr Gly 

600 605 

val val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 

620 

Phe ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr -Tyr Gly Lys Leu 
Thr Leu Lys Phe He Cys Thr Thr Gly Lys III Pro Val Pro Trp "o 



Thr Leu val Thr Thr Leu Thr Tyr Gly til Gin Cys Phe Ser "g Tyr 



665 



Pro ASP His Met Lys Gin His Asp Phe Phe Lys Ser Ala MeJ Pro Glu 

680 goc 
Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn ^r 

695 700 
I.ys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 

lie Glu Leu Lys Gly He Asp Phe Lys Glu III Gly Asn He Leu Gil 

His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He III Ala 
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740 745 750 

Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn 

755 760 765 

lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
5 770 775 780 

Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
785 790 795 800 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
805 810 815 

10 Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp 
820 825 830 

Glu Leu Tyr Lys 
835 

15 <2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHZUIACTERISTICS : 

(A) LENGTH: 1893 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
<ix) FEATURE: 

25 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . . 1890 
(D) OTHER INFORMATION: 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

ATG AGC AGA AGC AAG CGT GAC AAC AAT TTT TAT AGT GTA GAG ATT GGA 48 

Met Ser Arg Ser Lys Arg Asp Asn Asn Phe Tyr Ser Val Glu lie Gly 

15 10 15 

35 

GAT TCT ACA TTC AC A GTC CTG AAA CGA TAT CAG AAT TTA AAA CCT ATA 96 

Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro lie 

20 25 30 

40 GGC TCA GGA GCT CAA GGA ATA GTA TGC GCA GCT TAT GAT GCC ATT CTT 144 

Gly Ser Gly Ala Gin Gly lie Val Cys Ala Ala Tyr Asp Ala lie Leu 
35 40 45 

GAA AGA AAT GTT GCA ATC AAG AAG CTA AGC CGA CCA TTT CAG AAT CAG 192 
45 Glu Arg Asn Val Ala lie Lys Lys Leu Ser Arg Pro Phe Gin Asn Gin 
50 55 60 

ACT CAT GCC AAG CGG GCC TAC AGA GAG CTA GTT CTT ATG AAA TGT GTT 24 0 

Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val 
50 65 70 75 80 

AAT CAC AAA AAT ATA ATT GGC CTT TTG AAT GTT TTC ACA CCA CAG AAA 2 88 

Asn His Lys Asn lie lie Gly Leu Leu Asn Val Phe Thr Pro Gin Lys 
85 90 95 



55 



TCC CTA GAA GAA TTT CAA GAT GTT TAC ATA GTC ATG GAG CTC ATG GAT 336 
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50 



WO 98/45704 

PCT/DK98/00145 

134 

Ser Leu Glu Glu Phe Gin Asp Val Tyr He Val Met Glu Leu Met Asp 



GCA AAT CTT TGC CAA GTG ATT CAG ATG GAG CTA GAT CAT GAA ann »^ 
5 Ala Asn .eu Cys Gin Val xle Gin Met Glu asJ X ^T. 

120 

TCC TAC CTT CTC TAT CAG ATG CTC TGT GGA ATC AAG CAC CTT CAT TCT 

10 To 



140 



TGG TCT GTG GGG TGC ATT ATr: nna r-ny^ «m« 

, ^^^-^ ATG GTT TGC CAC AAA ATr r-Tr- 

Xrp ser Val Gly Cys He Met Gly Glu Met Val Cys His ^ Til Su 



220 



TTT CCA GGA AGG GAC TAT ATT GAT CAG TGG AAT AAA GTT ATT GAA CAG 
Phe Pro Gly Arg Asp Tyr He Asp Gin Trp Asn I.ys Val He Sn 
35 "° 235 240 

7Z n?"" '"^'^ ATG AAG AAA CTG CAA CCA ACA GTA 

I-eu Gly Thr Pro Cys Pro Glu Phe Met I.ys Lys Leu pS rTr Tl 



245 250 255 



Jrg Jhr Sr 511 gJ^ T f ^ ^ '^^^ ^^^^ 

Arg Thr Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Tyr Ser Phe Glu 

260 265 



270 



Su A?a ser S^n Sa ^ f ^ '''''' ^ ^ ^TA ATA 

290 wet Leu Val He 



295 



300 



ASP S ^ "^^^ °CT CTC CAA CAC CCG TAC 

305 ''''^ "! ^^'^ Ala Leu Gin His Pro Tyr 

55 315 320 



384 



432 



a" S f ■ ! f ° '^^'^ AGT AAT ATA GTA GTA AAA 

Ala Gly He He Has Arg Asp Leu Lys Pro Ser Asn He Val Val Lys 

15 ^° "5 160 

TCT GAT TGC ACT TTG AAG ATT CTT GAC TTC GGT CTG GCC AGG ACT GCA 

ser ASP cys Thr Leu Lys He Leu Asp Phe Gly Leu Ixl Arg JSr Sa 
165 170 

GGA ACG AGT TTT ATG ATG ACG CCT TAT GTA GTG ACT CGC TAC TAC AGA 
Gly Thr ser Phe Met Met Thr Pro Tyr Val Val Thr Arg ^r ^r Z 
"° 185 190 

200 205 



480 



528 



576 



672 



720 



768 



816 



280 285 



912 



960 



1008 
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He Asn Val Trp Tyr Asp Pro Ser Glu Ala Glu Ala Pro Pro Pro Lys 
325 330 335 

ATC CCT GAC AAG CAG TTA GAT GAA AGG GAA CAC ACA ATA GAA GAG TGG 1056 
5 lie Pro Asp Lys Gin Leu Asp Glu Arg Glu His Thr lie Glu Glu Trp 
340 345 350 

AAA GAA TTG ATA TAT AAG GAA GTT ATG GAC TTG GAG GAG AGA ACC AAG 1104 
Lys Glu Leu He Tyr Lys Glu Val Met Asp Leu Glu Glu Arg Thr Lys 
10 355 360 365 

AAT GGA GTT ATA CGG GGG CAG CCC TCT CCT TTA GCA CAG GTG CAG CAG 1152 

Asn Gly Val He Arg Gly Gin Pro Ser Pro Leu Ala Gin Val Gin Gin 
370 375 380 

15 

TGG GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC 1200 

Trp Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe 
385 390 395 400 

20 ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC 1248 
Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly 
405 410 415 

CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC 1296 
25 His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 
420 425 430 

AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC 1344 
Lys Leu Thr Leix Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro 
30 435 440 445 

TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC 13 92 

Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 
450 455 460 

35 

CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG 1440 
Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 
465 470 475 480 

40 CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC 1488 

Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly 
485 490 495 

AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG 1536 
45 Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 
500 505 510 

AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC 1584 
Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He 
50 515 520 525 

CTG GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC 1632 
Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He 
530 535 540 



55 



ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC 1680 
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Asn Gly He 

550 



Met Ala Asp Lys Gin Lys Asn Gly He Lvs v^l i,^^ x 
545 ^ ^^-^ Asn Phe Lys He Arg 

560 

S S JJ^ S S - 



575 



Z S ?S Z ^ S S 2.^ Z T. 



590 



s^;.' ir. s° r r °- 

535 ^^"^ f^^ l-ys Asp Pro Asn Glu Lys Arg Asp 

15 "° 605 

S S S S S.^ 2?= S= .™ S 

^■^^ 620 



625 



630 



(2) INFORMATION FOR SEQ ID NO: 63: 



(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 630 amino acids 
<B) TYPEr amino acid 
tC) STRANDEDNESS : single 
(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 



Met ser Arg Ser Lys Arg Asp Asn Asn Phe Tyr Ser Val Glu He Gly 
40 ASP ser Thr P.e Thr Val Leu Lys Arg ^ Gin Asn Leu Lys Pro He 
^ Oly ser Gly Ala Gin Gly He Val Cys Ala Ala lyr Asp IL He Leu 



Glu Arg Asn Val Ala He Lys Jys Leu Ser Arg Pro Phe Gin Asn Gin 
Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val 
Asn His Lys Asn He He Gly Leu Leu Asn 111 Phe Thr P.o Gin L^s 
50 ser Leu Glu Glu Phe Gin Asp Val ^r II Val Met Olu Leu M^t Asp 
Ala Asn Leu Cys Gin Val He Gin III Glu Leu Asp His cll Arg Met 
55 7,1 -u cys Gly xie Lys L's Leu His Ser 

Ala Gly He He His Arg Asp Leu Lys Pro Ser 12 He Val Val Lys 



1728 



1776 



1824 



1872 



ATG GAC GAG CTG TAC AAG TAA 

Met Asp Glu Leu Tyr Lys 1893 
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145 150 155 160 

Ser Asp Cys Thr Leu Lys lie Leu Asp Phe Gly Leu Ala Arg Thr Ala 

165 170 175 

Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg 
5 180 185 190 

Ala Pro Glu Val lie Leu Gly Met Gly Tyr Lys Glu Asn Val Asp Leu 

195 200 205 

Trp Ser Val Gly Cys He Met Gly Glu Met Val Cys His Lys He Leu 
210 215 220 

10 Phe Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val He Glu Gin 
225 230 235 240 

Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr Val 

245 250 255 

Arg Thr Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Tyr Ser Phe Glu 
15 260 265 270 

Lys Leu Phe Pro Asp Val Leu Phe Pro Ala Asp Ser Glu His Asn Lys 

275 280 285 

Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu Val He 
290 295 300 

20 Asp Ala Ser Lys Arg He Ser Val Asp Glu Ala Leu Gin His Pro Tyr 
305 310 315 320 

He Asn Val Trp Tyr Asp Pro Ser Glu Ala Glu Ala Pro Pro Pro Lys 

325 330 335 

He Pro Asp Lys Gin Leu Asp Glu Arg Glu His Thr He Glu Glu Trp 
25 340 345 350 

Lys Glu Leu He Tyr Lys Glu Val Met Asp Leu Glu Glu Arg Thr Lys 

355 360 365 

Asn Gly Val He Arg Gly Gin Pro Ser Pro Leu Ala Gin Val Gin Gin 
370 375 380 

30 Trp Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe 
385 390 395 400 

Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly 

405 410 415 

His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 
35 420 425 430 

Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro 

435 440 445 

Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 
450 455 460 

40 Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 
465 470 475 480 

Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly 

485 490 495 

Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 
45 500 505 510 

Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He 

515 520 525 

Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He 
530 535 540 

•50 Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg 
545 550 555 560 

His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin 

565 570 575 

Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr 
55 580 585 590 

Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp 
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600 



His Met val X,eu Leu Glu Phe VaX Thr Ala Ala Gly lie Thr .eu Gly 
Met Asp Glu Leu Tyr Lys 



625 



630 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1818 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

25 Ill oTn S Pro iir lit ™" ^ ^ " 

^ n <^Lu Arg Pro Thr Phe Tyr Arg Gin Glu Leu Asn Lys Thr 



10 



xTe 111 vll p'''' T """^ ^^'^ -""^ ^-^G TCT 

lie Trp Glu val Pro Glu Arg Tyr Gin Asa Leu Ser Pro Val Gly Ser 



25 



30 



Tly lit ^r Sv Sr vTl T ^ ^'^^ ^ 

ly Ala ^r Gly Ser Val Cys Ala Ala Phe Asp Thr Lys Thr Gly Leu 

35 ^° 4S 

s SI? s ie: r r 

9 ^al Ala Val Lys Lys Leu Ser Arg Pro Phe Gin Ser He He His 

GCG AAA AGA ACC TAC AGA GAA CTG CGG TTA CTT AAA CAT ATG AAA CAT 
Ala Lys Arg Thr Tyr Arg Glu Leu Arg Leu Leu Lys His Sit uTs 



75 80 



85 



CTG AAC AAC ATT GTQ AAA TGT CAG AAG CTT ACA GAT GAG CAT GTT CAG 
Leu Asn Asn He Val Lys Cys Gin Lys Leu Thr Asp Asp nil vll SJn 
55 125 

TTC CTT ATC TAC CAA ATT CTC CGA GGT CTA AAG TAT ATA CAT TCA GCT 



96 



144 



192 



240 



336 



384 



432 
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Phe Leu lie Tyr Gin lie Leu Arg Gly Leu Lys Tyr He His Ser Ala 
130 X35 140 

GAC ATA ATT CAC AGO GAC CTA AAA CCT AGT AAT CTA GCT GTG AAT GAA 4 80 

5 Asp He He His Arg Asp Leu Lys Pro Ser Asn Leu Ala Val Asn Glu 
145 150 155 160 

GAC TGT GAG CTG AAG ATT CTG GAT TTT GGA CTG GCT CGG CAC ACA GAT 528 
Asp Cys Glu Leu Lys He Leu Asp Phe Gly Leu Ala Arg His Thr Asp 
10 165 170 175 

GAT GAA ATG ACA GGC TAG GTG GCC ACT AGG TGG TAG AGG GCT CCT GAG 576 

Asp Glu Met Thr Gly Tyr Val Ala Thr Arg Trp Tyr Arg Ala Pro Glu 

180 185 190 

15 

ATC ATG CTG AAC TGG ATG CAT TAG AAC CAG ACA GTT GAT ATT TGG TCA 624 

He Met Leu Asn Trp Met His Tyr Asn Gin Thr Val Asp He Trp Ser 
195 200 205 

20 GTG GGA TGC ATA ATG GCC GAG CTG TTG ACT GGA AGA ACA TTG TTT CCT 672 
Val Gly Cys He Met Ala Glu Leu Leu Thr Gly Arg Thr Leu Phe Pro 
210 215 220 

GGT ACA GAC CAT ATT GAT CAG TTG AAG CTC ATT TTA AGA CTC GTT GGA 720 
25 Gly Thr Asp His He Asp Gin Leu Lys Leu He Leu Arg Leu Val Gly 
225 230 235 240 

ACC CCA GGG GCT GAG CTT TTG AAG AAA ATC TCC TCA GAG TCT GCA AGA 768 
Thr Pro Gly Ala Glu Leu Leu Lys Lys He Ser Ser Glu Ser Ala Arg 
30 245 250 255 

AAC TAT ATT CAG TCT TTG ACT CAG ATG CCG AAG ATG AAC TTT GCG AAT 816 
Asn Tyr He Gin Ser Leu Thr Gin Met Pro Lys Met Asn Phe Ala Asn 
260 265 270 

35 

GTA TTT ATT GGT GCC AAT CCC CTG GCT GTC GAC TTG CTG GAG AAG ATG 864 
Val Phe He Gly Ala Asn Pro Leu Ala Val Asp Leu Leu Glu Lys Met 
275 280 285 

40 CTT GTA TTG GAC TCA GAT AAG AGA ATT ACA GCG GCC CTVA GCC CTT GCA 912 
Leu Val Leu Asp Ser Asp Lys Arg He Thr Ala Ala Gin Ala Leu Ala 
290 295 300 

CAT GCC TAC TTT GCT CAG TAC CAC GAT CCT GAT GAT GAA CCA GTG GCC 960 
45 His Ala Tyr Phe Ala Gin Tyr His Asp Pro Asp Asp Glu Pro Val Ala 
305 310 315 320 

GAT CCT TAT GAT CAG TCC TTT GAA AGC AGG GAC CTG CTT ATA GAT GAG 1008 
Asp Pro Tyr Asp Gin Ser Phe Glu Ser Arg Asp Leu Leu He Asp Glu 
50 325 330 335 

TGG AAA AGC CTG ACC TAT GAT GAA GTC ATC AGC TTT GTG CCA CCA CCC 1056 
Trp Lys Ser Leu Thr Tyr Asp Glu Val He Ser Phe Val Pro Pro Pro 
340 345 350 



55 



CTT GAC CAA GAA GAG ATG GAG TCC GAG GAT CCA CCG GTC GCC ACC ATG 1104 
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Leu ASP Gin Glu Glu Met Glu Ser Glu Asp Pro Pro Val Ala Thr Met 



360 



365 



GTG AGC AAG GGC GAG GAG CTG TTC Ann nnn r.rr^ « 

ir^-t ««V3 lit, ACC GGG GTG GTG CCC ATO nTn o-r/- 

val ser Lys Gly Glu Glu Leu Phe Thr Gly Val vl? Pro ?Ie lI^ 



375 380 



GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG 
^ Glu Leu ASP Gly Asp Val Asn Gly His Lys Phe Ser vll Sr Sy Su 

395 

5 410 415 

?s g s ^= s s s ?s ?s s 

430 

r s s s z s S ^ 

T^^ ^. L- J- s ?s s= tr. z 



460 



?hr ^le Ph» ir '^'^ '^^'^ ^^'^ ™^ CGC GCC GAG GTG 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Olu vTl 

"° 475 
490 

510 

^ ;s ™' r "° 

vax Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
js val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val 
C^n L^L' O^l X r ^^'^ 

545 " T^*^ lie Gly Asp Gly Pro 

"° 555 

l'Iu Su Pro As^ r J^"' '^^^ ^'^^ 

i-eu Pro Asp Asn Has Tyr Leu Ser Thr Gin Ser Ala Leu Ser 



565 



570 



575 



AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CIXS GAG TTC GTG 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 
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Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

580 585 590 

ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAG AAG TAA X821 

Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
595 600 605 



(2) INFORMATION FOR SEQ ID NO: 65: 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 606 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Met Ser Gin Glu Arg Pro Thr Phe Tyr Arg Gin Glu Leu Asn Lys Thr 

15 10 15 

lie Trp Glu Val Pro Glu Arg Tyr Gin Asn Leu Ser Pro Val Gly Ser 
25 20 25 30 

Gly Ala Tyr Gly Ser Val Cys Ala Ala Phe Asp Thr Lys Thr Gly Leu 

35 40 45 

Arg Val Ala Val Lys Lys Leu Ser Arg Pro Phe Gin Ser lie lie His 
50 55 60 

30 Ala Lys Arg Thr Tyr Arg Glu Leu Arg Leu Leu Lys His Met Lys His 
65 70 75 80 

Glu Asn Val lie Gly Leu Leu Asp Val Phe Thr Pro Ala Arg Ser Leu 

85 90 95 

Glu Glu Phe Asn Asp Val Tyr Leu Val Thr His Leu Met Gly Ala Asp 
35 100 105 110 

Leu Asn Asn lie Val Lys Cys Gin Lys Leu Thr Asp Asp His Val Gin 

115 120 125 

Phe Leu lie Tyr Gin lie Leu Arg Gly Leu Lys Tyr lie His Ser Ala 
130 135 140 

40 Asp lie lie His Arg Asp Leu Lys Pro Ser Asn Leu Ala Val Asn Glu 
145 150 155 160 

Asp Cys Glu Leu Lys lie Leu Asp Phe Gly Leu Ala Arg His Thr Asp 

165 170 175 

Asp Glu Met Thr Gly Tyr Val Ala Thr Arg Trp Tyr Arg Ala Pro Glu 
45 180 185 190 

lie Met Leu Asn Trp Met His Tyr Asn Gin Thr Val Asp lie Trp Ser 

195 200 205 

Val Gly Cys lie Met Ala Glu Leu Leu Thr Gly Arg Thr Leu Phe Pro 
210 215 220 

50 Gly Thr Asp His lie Asp Gin Leu Lys Leu He Leu Arg Leu Val Gly 
225 230 235 240 

Thr Pro Gly Ala Glu Leu Leu Lys Lys He Ser Ser Glu Ser Ala Arg 

245 250 255 

Asn Tyr He Gin Ser Leu Thr Gin Met Pro Lys Met Asn Phe Ala Asn 
55 260 265 270 

Val Phe He Gly Ala Asn Pro Leu Ala Val Asp Leu Leu Glu Lys Met 
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275 



260 



.eu val .eu Asp ser Asp Lys Ar^ lie XHr Ala Ala oin Ala Leu Ala 
His Ala ryr Phe Ala Gin Tyr His Asp P.o Asp Isp Glu Pro Val Ala 

ASP Pro Xyr Asp Gin Ser Phe Glu Ser Arg Isp Leu Leu He Asp G^^ 

330 

Trp Lys Ser Leu Thr Tyr Asp Glu Val He Ser Phe Val Pro Pro Pro 

■ Leu Asp Gin Glu Glu Met Glu Ser g" Asp Pro Pro Val HI Thr Met 

val ser Lys Gly Glu Glu Leu Itl Thr Gly Val Val Pro He Leu Val 

Glu Leu ASP Gly Asp Val As" Gly His Lys Phe Se^ Val Ser Gly Glu 

jyO 395 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He "s 

Thr Thr Gly Lys Leu Pro Val Pro Trp til Thr Leu Val Thr HI Leu 

Thr Tyr Gly Val Gin Cys Phe Ser 111 Tyr Pro Asp His ^11 Lys Gin 

His ASP Phe Phe Lys Ser Ala ttl Pro Glu Gly Tyr ill Gin Glu Arg 

Thr He Phe Phe Lys Asp Isp Gly Asn Tyr Lys ill Arg Ala Glu Val 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg h" gIu Leu Lys Gly ll': 

490 

ASP Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu ^yr Asn 

Tyr Asn ser His Asn Val Tyr He Met Ala Asp Lys Gin L^s Asn Gly 

He Lys val Asn Phe Lys lie Irg His Asn He Glu 111 Gly Ser Val 

Gin Leu Ala Asp His Tyr "n Gin Asn Thr Pro He Gly Asp Gly Pro 

val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu ler 

570 

Lys Asp Pro Asn. Glu Lys Arq asd His M^i- m^i r . 

QOrt ^ ^eu Glu Phe Val 

585 



Thr Ala Ala Gly He Thr Leu Gly ^ll ^sp Glu Leu Tyr Lys 

605 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2910 
(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

ATG AGT GOT GAG GGG TAG CAG TAG AGA GCG CTG TAT GAT TAT AAA AAG 4 8 

5 Met Ser Ala Glu Gly Tyr Gin Tyr Arg Ala Leu Tyr Asp Tyr Lys Lys 
15 10 15 

GAA AGA GAA GAA GAT ATT GAC TTG CAC TTG GGT GAC ATA TTG ACT GTG 96 
Glu Arg Glu Glu Asp lie Asp Leu His Leu Gly Asp He Leu Thr Val 
10 20 25 30 

AAT AAA GGG TCC TTA GTA GCT CTT GGA TTC AGT GAT GGA CAG GAA GCC 144 
Asn Lys Gly Ser Leu Val Ala Leu Gly Phe Ser Asp Gly Gin Glu Ala 
35 40 45 

15 

AGG CCT GAA GAA ATT GGC TGG TTA AAT GGC TAT AAT GAA ACC ACA GGG 192 
Arg Pro Glu Glu He Gly Trp Leu Asn Gly Tyr Asn Glu Thr Thr Gly 
50 55 60 

20 GAA AGG GGG GAC TTT CCG GGA ACT TAC GTA GAA TAT ATT GGA AGG AAA 24 0 

Glu Arg Gly Asp Phe Pro Gly Thr Tyr Val Glu Tyr He Gly Arg Lys 
65 70 75 80 

AAA ATC TCG CCT CCC ACA CCA AAG CCC CGG CCA CCT CGG CCT CTT CCT 288 
25 Lys He Ser Pro Pro Thr Pro Lys Pro Arg Pro Pro Arg Pro Leu Pro 

85 90 95 

GTT GCA CCA GGT TCT TCG AAA ACT GAA GGA GAT GTT GAA CAA CAA GCT 336 
Val Ala Pro Gly Ser Ser Lys Thr Glu Ala Asp Val Glu Gin Gin Ala 
30 100 105 110 

TTG ACT CTC CCG GAT CTT GCA GAG CAG TTT GCC CCT CCT GAC ATT GCC 3 84 

Leu Thr Leu Pro Asp Leu Ala Glu Gin Phe Ala Pro Pro Asp He Ala 
115 120 125 

35 

CCG CCT CTT CTT ATC AAG CTC GTG GAA GCC ATT GAA AAG AAA GGT CTG 4 32 

Pro Pro Leu Leu He Lys Leu Val Glu Ala He Glu Lys Lys Gly Leu 
130 135 140 

40 GAA TGT TCA ACT CTA TAC AGA ACA CAG AGC TCC AGC AAC CTG GCA GAA 480 

Glu Cys Ser Thr Leu Tyr Arg Thr Gin Ser Ser Ser Asn Leu Ala Glu 
145 150 155 160 

TTA CGA CAG CTT CTT GAT TGT GAT ACA CCC TCC GTG GAC TTG GAA ATG 528 
45 Leu Arg Gin Leu Leu Asp Cys Asp Thr Pro Ser Val Asp Leu Glu Met 

165 170 175 

ATC GAT GTG CAC GTT TTG GCT GAC GCT TTC AAA CGC TAT CTC CTG GAC 576 
He Asp Val His Val Leu Ala Asp Ala Phe Lys Arg Tyr Leu Leu Asp 
50 180 185 190 

TTA CCA AAT CCT GTC ATT CCA GCA GCC GTT TAC AGT GAA ATG ATT TCT 624 
Leu Pro Asn Pro Val He Pro Ala Ala Val Tyr Ser Glu Met He Ser 
195 200 205 



55 
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Leu Ala Pro Glu Val Gin Ser Ser Glu Glu Tyr He Gin Leu Leu Lys 

215 220 

i S JI^ s s s s s s s z s ?s s 

23S 240 

CAG TAT TTG TTA AAA CAT TTC TTC AAG CTC TCT CAA ACC TCC ACr iviv» 
Gin ryr X.eu Leu .ys His Phe Phe Lys Leu Ser G^^ I^r S« 

^" 250 255 

AAT CTG TTG AAT GCA AGA GTA CTC TCT GAA ATT TTC AGC OCT ATG CTT 
Asn Leu Leu Asn Ala Arg Val Leu Ser Glu He Phe sir Pro Set Su 
260 265 270 

J^g III aS aS r r 

275 "'^'^ 1-6" He Lys 

280 285 

?S lie ?n CAG CCT GCA 

val lie Glu He Leu He Ser Thr Glu Trp Asn Glu Arg Gin Pro Ala 

300 

CCA GCA CTG CCT CCT AAA CCA CCA AAA CCT ACT ACT GTA GCC AAC AAC 
Pro Ala Leu Pro Pro Lys Pro Pro Lys Pro Thr Thr Val A^^ 

z s - s s: z tr. ^ - s 

325 330 

GAT ATC TCG AGG GAA GAA GTG AAT GAA AAA CTT C6A GAT ACA GCA GAC 
ASP He ser Arg Glu Glu Val Asn Glu Lys Leu Arg Asp Asp 

?S ^ S Z - S Z Z Z 

360 3g5 

ACT CTT ACA CTA AGG AAA GGG GGA AAT AAC AAA TTA ATC AAA ATA TTT 
Thr Leu Thr Leu Arg Lys Gly Gly Asn Asn Lys Leu He He pS 

^■'S 380 

CAT CGA GAT GGG AAA TAT GGC TTC TCT GAC CCA TTA ACC TTC AGT TCT 
H.S Arg ASP Gly Lys ^r Gly Phe Ser Asp Pro Leu Jhr 7ut Ser s" 

vl? He r ^^'^ <^^-^ CAG TAT 

val val Glu Leu He Asn His Tyr Arg Asn Glu Ser Leu Ala Gin Tyr 

" 410 

425 

CAG GAT CAA GTT GTC AAA GAA GAT AAT ATT GAA GCT GTA GGG AAA AAA 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 
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Gin Asp Gin Val Val Lys Glu Asp Asn lie Glu Ala Val Gly Lys Lys 

435 440 445 

TTA CAT GAA TAT AAC ACT CAG TTT CAA GAA AAA AGT CGA GAA TAT GAT 1392 
5 Leu His Glu Tyr Asn Thr Gin Phe Gin Glu Lys Ser Arg Glu Tyr Asp 
450 455 460 

AGA TTA TAT GAA GAA TAT AGO CGC ACA TCC CAG GAA ATC CAA ATG AAA 1440 
Arg Leu Tyr Glu Glu Tyr Thr Arg Thr Ser Gin Glu lie Gin Met Lys 
10 465 470 475 480 

AGG ACA GCT ATT GAA GCA TTT AAT GAA ACC ATA AAA ATA TTT GAA GAA 1488 
Arg Thr Ala lie Glu Ala Phe Asn Glu Thr lie Lys lie Phe Glu Glu 
485 490 495 

15 

CAG TGC CAG ACC CAA GAG CGG TAC AGC AAA GAA TAC ATA GAA AAG TTT 1536 
Gin Cys Gin Thr Gin Glu Arg Tyr Ser Lys Glu Tyr lie Glu Lys Phe 
500 505 510 

20 AAA CGT GAA GGC AAT GAG AAA GAA ATA CAA AGG ATT ATG CAT AAT TAT 1584 

Lys Arg Glu Gly Asn Glu Lys Glu lie Gin Arg lie Met His Asn Tyr 
515 520 525 

GAT AAG TTG AAG TCT CGA ATC AGT GAA ATT ATT GAC AGT AGA AGA AGA 1632 
25 Asp Lys Leu Lys Ser Arg lie Ser Glu lie lie Asp Ser Arg Arg Arg 
530 535 540 

TTG GAA GAA GAC TTG AAG AAG CAG GCA GCT GAG TAT CGA GAA ATT GAC 1680 
Leu Glu Glu Asp Leu Lys Lys Gin Ala Ala Glu Tyr Arg Glu lie Asp 
30 545 550 555 560 

AAA CGT ATG AAC AGC ATT AAA CCA GAC CTT ATC CAG CTG AGA AAG ACG 1728 

Lys Arg Met Asn Ser lie Lys Pro Asp Leu lie Gin Leu Arg Lys Thr 

565 570 575 

35 

AGA GAC CAA TAC TTG ATG TGG TTG ACT CAA AAA GGT GTT CGG CAA AAG 1776 

Arg Asp Gin Tyr Leu Met Trp Leu Thr Gin Lys Gly Val Arg Gin Lys 
580 585 590 

40 AAG TTG AAC GAG TGG TTG GGC AAT GAA AAC ACT GAA GAC CAA TAT TCA 1824 
Lys Leu Asn Glu Trp Leu Gly Asn Glu Asn Thr Glu Asp Gin Tyr Ser 
595 600 605 

CTG GTG GAA GAT GAT GAA GAT TTG CCC CAT CAT GAT GAG AAG ACA TGG 1872 
45 Leu Val Glu Asp Asp Glu Asp Leu Pro His His Asp Glu Lys Thr Trp 
610 615 620 

AAT GTT GGA AGC AGC AAC CGA AAC AAA GCT GAA AAC CTG TTG CGA GGG 1920 
Asn Val Gly Ser Ser Asn Arg Asn Lys Ala Glu Asn Leu Leu Arg Gly 
50 625 630 635 640 

AAG CGA GAT GGC ACT TTT CTT GTC CGG GAG AGC AGT AAA CAG GGC TGC 1968 
Lys Arg Asp Gly Thr Phe Leu Val Arg Glu Ser Ser Lys Gin Gly Cys 
645 650 655 



55 



TAT GCC TGC TCT GTA GTG GTG GAC GGC GAA GTA AAG CAT TGT GTC ATA 2016 
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Tyr Ala Cys Ser Val Val Val Asp Gly Glu Val Lys His Cys Val He 

665 



670 



AAC AAA ACA GCA ACT GGC TAT GGC ITT GCC GAG CCC TAT AAC TTG TAC 
Asn Lys Thr Ala Thr Gly Tyr Gly Phe Ala Glu Pro lyl IIu 



AGC TCT CTG AAA GAA CTG GTG CTA CAT TAC CAA CAC ACC TCC CTT GTG 
ser ser .eu Lys Glu Leu Val Leu His Tyr Gin His ^hr str I7u vTl 



695 



700 



Sn uf. f f ^ "^'^ ^™ TAC CCA GTA TAT GCA 

Gin Hxs Asn Asp Ser Leu Asn Val Thr Leu Ala Tyr Pro vll ^r aS 

715 

CAG CAG AGG CGA CAG GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC 
Gin Gin Arg Arg Gin Asp Pro Pro Val Ala Thr Met Val s« oTy 
'25 730 

^ s ?S - ^ - s S= S 



745 



750 



GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT 
ASP val Asn Gly His Lys Phe Ser Val Ser Gly Glu Sy Oil tty Z 

760 - 



765 



t^"" t^"" I*"" """^ *AG TTC ATC TGC ACC ACC GGC AAG 

Ala Thr TVr Gly Lys Leu Thr Leu Lys Phe He Cys Sy ^s 



780 



S= ?I? s s s S ?s ?s s ?s s 

795 

S S S S.^ ^. - - 21 

«05 810 815 

AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC 
Lys ser Ala Met Pro Glu Gly -ryr Val Gin Glu Arg Thr He pj^ 
820 825 830 

AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC 
Ly. ASP ASP Gly Asn Tyr Lys Thr Arg Ala Glu Val 21 txu Sy 



845 



"° 875 880 

AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



2640 



2688 
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Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 
885 890 895 

TTC AAG ATC CGC CAC AAC ATC GAG GAG GGC AGC GTG GAG CTC GCC GAG 2 736 
5 Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 
900 905 910 

CAC TAG CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC 2784 
His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 
10 . 915 920 925 

GAC AAC CAC TAG CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC 2 832 

Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 
930 935 940 

15 

GAG AAG CGC GAT CAC ATG GTG CTG CTG GAG TTC GTG ACC GCC GCC GGG 2 880 

Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
945 950 955 960 

20 ATC ACT CTC GGC ATG GAC GAG CTG TAG AAG TAA 2913 
He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
965 970 

25 (2) INFORMATION FOR SEQ ID N0:67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 970 amino acids 

(B) TYPE: amino acid 

30 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 



Met Ser Ala Glu Gly Tyr Gin Tyr Arg Ala Leu Tyr Asp Tyr Lys Lys 
15 10 15 

40 Glu Arg Glu Glu Asp He Asp Leu His Leu Gly Asp He Leu Thr Val 
20 25 30 

Asn Lys Gly Ser Leu Val Ala Leu Gly Phe Ser Asp Gly Gin Glu Ala 

35 40 45 

Arg Pro Glu Glu He Gly Trp Leu Asn Gly Tyr Asn Glu Thr Thr Gly 
45 50 55 60 

Glu Arg Gly Asp Phe Pro Gly Thr Tyr Val Glu Tyr He Gly Arg Lys 
65 70 75 80 

Lys He Ser Pro Pro Thr Pro Lys Pro Arg Pro Pro Arg Pro Leu Pro 
85 90 95 

50 Val Ala Pro Gly Ser Ser Lys Thr Glu Ala Asp Val Glu Gin Gin Ala 
100 105 110 

Leu Thr Leu Pro Asp Leu Ala Glu Gin Phe Ala Pro Pro Asp He Ala 

115 120 125 

Pro Pro Leu Leu He Lys Leu Val Glu Ala He Glu Lys Lys Gly Leu 
55 130 135 140 

Glu Cys Ser Thr Leu Tyr Arg Thr Gin Ser Ser Ser Asn Leu Ala Glu 



147 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 

PCT/DK98/00145 



148 



145 



150 



Leu Arg Gin Leu Leu Asp Cys Asp Thr Pro Ser Val Asp Leu Glu HI 



165 



170 



lie ASP val His val Leu Ala Asp Ala Phe Lys Arg Tyr Leu ill Asp 



185 



Leu Pro Asn Pro Val lie Pro Ala Val Tyr Ser Glu Mel lie Ser 



195 



200 



Leu Ala Pro Glu Val Gin Ser Glu Glu Tyr He III Leu Leu Lys 

Lys Leu lie Arg Ser Pro Ser He Pro His Gin ^Jr Trp Leu Thr Leu 



Gin Tyr Leu Leu Lys His Phe Phe Lys Leu Ser Gin Thr Ser Ser lys 



245 



250 



Asn Leu Leu Asn Ala Arg Val Leu Ser oZ He Phe Ser Pro Leu 



260 



265 



Phe Arg Phe Ser Ala Ala Ser Ser Asp Asn Thr Glu Asn Leu He Lys 

val He Glu He Leu He Ser Glu Trp Asn Glu Irg Gin Pro Ala 

300 

Pro Ala Leu Pro Pro Lys Pro Pro Lys Pro Thr Thr Val Ala Asn Asn 

310 315 -a 

Gly Met Asn Asn Asn Met Ser Leu Gin Asn Ala Glu Trp Tyr Trp Gly 

ASP He ser Arg g" Glu Val Asn Glu Lys Leu Arg Asp THr III Asp 

345 



Gly Thr Phe Leu Val Arg Asp Ala Ser Thr Lys Met His Gly Asp Tyr 

V. 360 365 

Thr Leu Thr Leu Arg Lys Gly Gly Asn Asn Lys Leu He Lys He Phe 

His Arg Asp Gly Lys Tyr Gly Phe Ser Asp Pro III Thr Phe Ser Ser 

390 3 95 

Val Val Glu Leu He Asn His Tyr Arg Asn Glu Ser Leu Ala Gin 

410 415 
Asn Pro Lys Leu Asp Val Lys Leu Leu Tyr Pro Val Ser Lys Tyr Gin 

Gin ASP Gin val Val Lys Glu Asp Asn He Glu Ala Val Gly Lys Lys 

* 440 
Leu H.S Glu Tyr Asn Thr Gin Phe Gin Glu Lys Ser Arg Glu Tyr Asp 

455 460 
Arg Leu Tyr Glu Glu Tyr Thr Arg Thr Ser Gin Glu He Gin Met Lys 

Arg Thr Ala He Glu Ala Phe Asn Glu Thr n'e Lys He Phe Glu III 

Gin cys Gin Thr Gin Glu Arg Tyr Ser lys Glu Tyr He Glu Jys Phe 

SOS 

Lys Arg Glu Gly Asn Glu Lys Glu He Gin Arg He Met His Asn Tyr 

_ 520 

ASP Lys Leu Lys Ser Arg He Ser Glu He He Asp Ser Arg Arg Arg 



Leu Glu Glu ASP Leu Lys Lys Gin Ala Ala Glu 'l^l Arg Glu He Asp 



550 



555 



Lys Arg Met Asn Ser He Lys Pro Asp Leu He Gin Leu Arg Lys Thr 



570 



Arg Asp Gin Tyr Leu Met Trp Leu Thr Gin Lys Gly Val Arg III Lys 
Lys Leu Asn Glu Trp Leu Gly Asn G^u Asn Thr Glu Asp "n Tyr Ser 
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595 600 605 

Leu Val Glu Asp Asp Glu Asp Leu Pro His His Asp Glu Lys Thr Trp 

610 615 620 

Asn Val Gly Ser Ser Asn Arg Asn Lys Ala Glu Asn Leu Leu Arg Gly 
5 625 630 635 640 

Lys Arg Asp Gly Thr Phe Leu Val Arg Glu Ser Ser Lys Gin Gly Cys 

645 650 655 

Tyr Ala Cys Ser Val Val Val Asp Gly Glu Val Lys His Cys Val lie 
660 665 670 

10 Asn Lys Thr Ala Thr Gly Tyr Gly Phe Ala Glu Pro Tyr Asn Leu Tyr 
675 660 6B5 

Ser Ser Leu Lys Glu Leu Val Leu His Tyr Gin His Thr Ser Leu Val 

690 695 700 

Gin His Asn Asp Ser Leu Asn Val Thr Leu Ala Tyr Pro Val Tyr Ala 
15 705 710 715 720 

Gin Gin Arg Arg Gin Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly 

725 730 735 

Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu Asp Gly 
740 745 750 

20 Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 
755 760 765 

Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys 

770 775 780 

Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val 
25 785 790 795 800 

Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 

805 810 . 815 

Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe 
820 825 830 

30 Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 
835 840 845 

Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu 

850 855 860 

Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His 
35 865 870 875 880 

Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 

885 890 895 

Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 
900 905 910 

40 His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 
915 920 925 

Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 

930 935 940 

Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
45 945 950 955 960 

He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
965 970 



50 



(2) INFORMATION FOR SEQ ID NO: 68: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
55 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE; cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . , . 1785 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Sit Ty 111 111 r A^^' -™ 

Met Gly Asn Ala Ala Ala Ala Lys Lys Gly Ser Glu Gin Glu Ser Val 

5 10 15 

AAA GAG TTC CTA GCC AAA GCC AAG GAA GAT ITC CTG AAA AAA TGG GAA 
Lys Glu Phe Leu Ala Lys Ala Lys Glu Asp Phe Leu 2^ Sp 



2S 



30 



GAC CCC TCT CAG AAT ACA GCC CAG TTG GAT CAG TTT GAT AGA ATC AAG 
ASP Pro ser Gin Asn Thr Ala Gin Leu Asp Gin Phe A^p ijs 



45 



55 60 



ill ser G?v ^^"^ T *™ °^<= ^= <^TG 

Glu ser Gly Asn Hxs Tyr Ala Met Lys He Leu Asp Lys Gin Lys Val 

" 80 
GTG AAG CTA AAG CAG ATC GAG CAC ACT CTG AAT GAG AAG CGC ATC CTG 
val Lys Leu Lys Gin He Glu His Thr Leu Asn Glu Lys Arg ill l^ 
85 90 35 

CAG GCC GTC AAC TTC CCG TTC CTG GTC AAA CTT GAA TTC TCC TTC AAC 
om Ala val Asn Phe Pro Phe Leu Val Lys Leu X 21 ^ 21 



105 



110 



Sp ^n J^n ^ ^^'^ GCT GGT GGC GAG 

Asp Asn ser Asn Leu Tyr Met Val Met Glu Tyr Val Ala Gly Gly Glu 



120 125 



21 ser f ° f ° CAT GCC 

Met Phe ser His Leu Arg Arg He Gly Arg Phe Ser Glu Pro His Ala 

"5 

111 ir '^'^ ^''^ ^'^^ ^^'^ C^ ACC TTT GAG TAT CTG CAC TCC 

Arg Phe Tyr Ala Ala Gin He Val Leu .^r Phe Glu Tyr ler 

"° "5 

CTG GAC CTC ATC TAC CGG GAC CTG AAG CCC GAG AAT CTT CTC ATC GAC 
X-eu ASP Leu He Tyr Arg Asp Leu Lys Pro Glu Asn Leu lIu He Sp 
165 ivo ;^75 



185 



190 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 
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AAA GGC CGT ACT TOG ACC TTG TGT GGG ACC CCT GAG TAG TTG GCC CCC 624 

Lys Gly Arg Thr Trp Thr Leu Cys Gly Thr Pro Glu Tyr Leu Ala Pro 

195 200 205 

5 

GAG ATT ATC CTG AGO AAA GGC TAC AAC AAG GCT GTG GAC TGG TGG GCT 672 

Glu lie lie Leu Ser Lys Gly Tyr Asn Lys Ala Val Asp Trp Trp Ala 

210 215 220 

10 CTC GGA GTC CTC ATC TAC GAG ATG GCT GCT GGT TAC CCA CCC TTC TTC 720 
Leu Gly Val Leu lie Tyr Glu Met Ala Ala Gly Tyr Pro Pro Phe Phe 
225 230 235 240 

GCT GAC CAG CCT ATC CAG ATC TAT GAG AAA ATC GTC TCT GGG AAG GTG 768 
15 Ala Asp Gin Pro lie Gin lie Tyr Glu Lys lie Val Ser Gly Lys Val 

245 250 255 

CGG TTC CCA TCC CAC TTC AGC TCT GAC TTG AAG GAC CTG CTG CGG AAC 816 
Arg Phe Pro Ser His Phe Ser Ser Asp Leu Lys Asp Leu Leu Arg Asn 
20 260 265 270 

CTT CTG CAA GTG GAT CTA ACC AAG CGC TTT GGA AAC CTC AAG GAC GGG 864 
Leu Leu Gin Val Asp Leu Thr Lys Arg Phe Gly Asn Leu Lys Asp Gly 
275 280 285 

25 

GTC AAT GAC ATC AAG AAC CAC AAG TGG TTT GCC ACG ACT GAC TGG ATT 912 
Val Asn Asp lie Lys Asn His Lys Trp Phe Ala Thr Thr Asp Trp lie 
290 295 300 

30 GCC ATC TAT CAG AGA AAG GTG GAA GCT CCC TTC ATA CCA AAG TTT AAA 960 

Ala lie Tyr Gin Arg Lys Val Glu Ala Pro Phe lie Pro Lys Phe Lys 
305 310 315 320 

GGC CCT GGG GAC ACG AGT AAC TTT GAC GAC TAT GAG GAG GAA GAG ATC 1008 
35 Gly Pro Gly Asp Thr Ser Asn Phe Asp Asp Tyr Glu Glu Glu Glu lie 

325 330 335 

CGG GTC TCC ATC AAT GAG AAG TGT GGC AAG GAG TTT ACT GAG TTT GGG 1056 
Arg Val Ser lie Asn Glu Lys Cys Gly Lys Glu Phe Thr Glu Phe Gly 
40 340 345 350 

CGC GCC ATG AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA ATT 1104 
Arg Ala Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie 
355 360 365 

45 

CTT GTT GAA TTA GAT GGC GAT GTT AAT GGG CAA AAA TTC TCT GTT AGT 1152 
Leu Val Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Ser 
370 375 380 

50 GGA GAG GGT GAA GGT GAT GCA ACA TAC GGA AAA CTT ACC CTT AAA TTT 12 00 

Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
385 390 395 400 

ATT TGC ACT ACT GGG AAG CTA CCT GTT CCA TGG CCA ACG CTT GTC ACT 1248 
55 lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 

405 410 415 
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162 



s 2^ ?s z i- ^ s z x ;™ 

430 

s° sj: 2^ s s - - - 

435 Ser Ala Met Pro Glu Gly Tyr Val Gin 



440 



c ^ s Sp s r "° ^ °- 

450 ^ ^ ° "^^^ Ala 



4" 460 



s 1?.^ s s SI ;s s lis 



475 



460 



GGT ATT GAT TTT AAA GAA GAT GGA AAC ATT CTT rr-A r^nr. 

«y ».p c.„ „p .^„" s z z if, ^. 



490 



495 



AAT GGC ATC AAA GTT AAr TTn aan yyrr^ ^ 

Asn Gly lie Uys III JJn 7^1 JJI J^n T T 

525 -^-^^ '^^S His Asn lie Lys Asp Gly 



520 



vii s s 2? ^ ^ Si Si :s s s 



540 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



SSS 560 

c s ^1 tTn z r r »,=s 

^ ^""^ Leu Leu Glu 



1776 



575 

SI 7.1 ?s aS a" ?r r 

Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
CCT CAG GAG TAA 

Pro Gin Glu 1788 
595 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 595 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

152 



SUBSTITUTE SHEET (RULE 26) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



Met 


Gly 


Asn 


Ala 


Ala 


Ala 


Ala 


Lys 


Lys 


Gly Ser 


Glu 


Gin 


Glu 


Ser 


Val 


1 








5 










10 










15 




Lys 


Glu 


Phe 


Leu 


Ala 


Lys 


Ala 


Lys 


Glu 


Asp 


Phe 


Leu 


Lys 


Lys 


Trp 


Glu 








20 










25 










30 






Asp 


Pro 


Ser 


Gin 


Asn 


Thr 


Ala 


Gin 


Leu 


Asp 


Gin 


Phe 


Asp 


Arg 


He 


Lys 






35 










40 










45 








Thr 


Leu 


Gly 


Thr 


Gly 


Ser 


Phe 


Gly 


Arg 


Val 


Met 


Leu 


Val 


Lys 


His 


Lys 




50 










55 










60 










Glu 


Ser 


Gly 


Asn 


His 


Tyr 


Ala 


Met 


Lys 


He 


Leu 


Asp 


Lys 


Gin 


Lys 


Val 


65 










70 










75 










80 


Val 


Lys 


Leu 


Lys 


Gin 


He 


Glu 


His 


Thr 


Leu 


Asn 


Glu 


Lys 


Arg 


He 


Leu 










85 










90 










95 




Gin 


Ala 


Val 


Asn 


Phe 


Pro 


Phe 


Leu 


Val 


Lys 


Leu 


Glu 


Phe 


Ser 


Phe 


Lys 








100 










105 










110 






Asp 


Asn 


Ser 


Asn 


Leu 


Tyr 


Met 


Val 


Met 


Glu 


Tyr 


Val 


Ala 


Gly 


Gly 


Glu 






115 










120 










125 








Met 


Phe 


Ser 


His 


Leu 


Arg 


Arg 


He 


Gly 


Arg 


Phe 


Ser 


Glu 


Pro 


His 


Ala 




130 










135 










140 










Arg 


Phe 


Tyr 


Ala 


Ala 


Gin 


He 


Val 


Leu 


Thr 


Phe 


Glu 


Tyr 


Leu 


His 


Ser 


145 










150 










155 










160 


Leu 


Asp 


Leu 


He 


Tyr 


Arg 


Asp 


Leu 


Lys 


Pro 


Glu 


Asn 


Leu 


Leu 


He 


Asp 










165 










170 










175 




Gin 


Gin 


Gly 


Tyr 


He 


Gin 


Val 


Thr 


Asp 


Phe 


Gly 


Phe 


Ala 


Lys 


Arg 


Val 








180 










185 










190 






Lys 


Gly 


Arg 


Thr 


Trp 


Thr 


Leu 


Cys 


Gly 


Thr 


Pro 


Glu 


Tyr 


Leu 


Ala 


Pro 






195 










200 










205 








Glu 


He 


He 


Leu 


Ser 


Lys 


Gly 


Tyr 


Asn 


Lys 


Ala 


Val 


Asp 


Trp 


Trp 


Ala 




210 










215 










220 










Leu 


Gly 


Val 


Leu 


He 


Tyr 


Glu 


Met 


Ala 


Ala 


Gly 


Tyr 


Pro 


Pro 


Phe 


Phe 


225 










230 










235 










240 


Ala 


Asp 


Gin 


Pro 


He 


Gin 


He 


Tyr 


Glu 


Lys 


He 


Val 


Ser Gly 


Lys 


Val 










245 










250 










255 




Arg 


Phe 


Pro 


Ser 


His 


Phe 


Ser 


Ser 


Asp 


Leu 


Lys 


Asp 


Leu 


Leu 


Arg 


Asn 








260 










265 










270 






Leu 


Leu 


Gin 


Val 


Asp 


Leu 


Thr 


Lys 


Arg 


Phe Gly 


Asn 


Leu 


Lys 


Asp 


Gly 






275 










280 










285 








Val 


Asn 


Asp 


He 


Lys 


Asn 


His 


Lys 


Trp 


Phe 


Ala 


Thr 


Thr 


Asp 


Trp 


He 




290 










295 










300 










Ala 


He 


Tyr 


Gin 


Arg 


Lys 


Val 


Glu 


Ala 


Pro 


Phe 


He 


Pro 


Lys 


Phe 


Lys 


305 










310 










315 










320 


Gly 


Pro 


Gly 


Asp 


Thr 


Ser 


Asn 


Phe 


Asp 


Asp 


Tyr 


Glu 


Glu 


Glu 


Glu 


He 










325 










330 










335 




Arg 


Val 


Ser 


He 


Asn 


Glu 


Lys 


Cys 


Gly 


Lys 


Glu 


Phe 


Thr 


Glu 


Phe 


Gly 








340 










345 










350 






Arg 


Ala 


Met 


Ser 


Lys 


Gly 


Glu 


Glu 


Leu 


Phe 


Thr 


Gly 


Val 


Val 


Pro 


He 






355 










360 










365 








Leu 


Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val 


Asn 


Gly Gin 


Lys 


Phe 


Ser 


Val 


Ser 




370 










375 










380 










Gly 


Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gly Lys 


Leu 


Thr 


Leu 


Lys 


Phe 
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^f^ 395 

Xle cys Thr xhr cly .ys .eu P« Val Pro Trp Pro Thr .eu val Z 

410 

Thr .eu r.r Oly Val 0^n Cys P.e Ser .rg ryr Pro .sp JL' Met 

.ys Gin His ASP Phe Phe .ys Ser 1" Met Pro Olu 01, val GXn 

Clu jrg Thr Xle Pl.e Tyr .y. ..p 

Olu val Lys Phe oiu ciy .sp xhr Leu Val Asn 1^^ xie Olu .eu .ys 



Gly lie Asp Phe Lys Glu Asn a=„ ti t^^ , 

ys ^xu ASP Gly Asn He Leu Gly His Lys Met Glu 

490 

Tyr Asn Tyr Asn Ser His Asn Val Tyr Xle Met Ala Asp Lys til Lys 
Asn Cly Xle Lys Val Asn Phe Lys III Arg His Asn xle Lys Asp Gly 
ser val Gin Leu Ala Asp His Gin Gin Asn Thr p" xle Gly Asp 

Gly Pro val Leu Leu Pro Asp Asn His Tyr Leu Itl Thr Gin Ser Ala 



550 



555 

Leu ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Xle Leu Leu Gl" 

^oi> 570 

Phe val TKr Ala Ala Gly xle Thr His Gly Met Asp Glu Leu JJr Lys 
Pro Gin Glu 590 
595 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 2181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2178 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70- 

»7 S Z S SI S - s ^. s s 



48 

1 Gly 

15 

96 



25 



30 



™^ ^ s s r 

y lyr i.ys Glu Arg Pro Gin Asp Val Asp Gin Arg 



144 



45 
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GAG GCT CCC CTC AAC AAC TTC TCT GTG GCG CAG TGC CAG CTG ATG AAG 192 

Glu Ala Pro Leu Asn Asn Phe Ser Val Ala Gin Cys Gin Leu Met Lys 
50 55 60 

5 

ACG GAG CGG CCC CGG CCC AAC ACC TTC ATC ATC CGC TGC CTG CAG TGG 240 

Thr Glu Arg Pro Arg Pro Asn Thr Phe lie lie Arg Cys Leu Gin Trp 
65 70 75 80 

10 ACC ACT GTC ATC GAA CGC ACC TTC CAT GTG GAG ACT CCT GAG GAG CGG 2 88 

Thr Thr Val lie Glu Arg Thr Phe His Val Glu Thr Pro Glu Glu Arg 
85 90 95 

GAG GAG TGG ACA ACC GCC ATC CAG ACT GTG GCT GAC GGC CTC AAG AAG 336 
15 Glu Glu Trp Thr Thr Ala lie Gin Thr Val Ala Asp Gly Leu Lys Lys 
100 105 110 

CAG GAG GAG GAG GAG ATG GAC TTC CGG TCG GGC TCA CCC AGT GAC AAC 384 
Gin Glu Glu Glu Glu Met Asp Phe Arg Ser Gly Ser Pro Ser Asp Asn 
20 115 120 125 

TCA GGG GCT GAA GAG ATG GAG GTG TCC CTG GCC AAG CCC AAG CAC CGC 432 

Ser Gly Ala Glu Glu Met Glu Val Ser Leu Ala Lys Pro Lys His Arg 
130 135 140 

25 

GTG ACC ATG AAC GAG TTT GAG TAC CTG AAG CTG CTG GGC AAG GGC ACT 480 

Val Thr Met Asn Glu Phe Glu Tyr Leu Lys Leu Leu Gly Lys Gly Thr 
145 150 155 160 

30 TTC GGC AAG GTG ATC CTG GTG AAG GAG AAG GCC ACA GGC CGC TAC TAC 52 8 

Phe Gly Lys Val lie Leu Val Lys Glu Lys Ala Thr Gly Arg Tyr Tyr 
165 170 175 

GCC ATG AAG ATC CTC AAG AAG GAA GTC ATC GTG GCC AAG GAC GAG GTG 576 
35 Ala Met Lys lie Leu Lys Lys Glu Val lie Val Ala Lys Asp Glu Val 
180 185 190 

GCC CAC ACA CTC ACC GAG AAC CGC GTC CTG CAG AAC TCC AGG CAC CCC 624 
Ala His Thr Leu Thr Glu Asn Arg Val Leu Gin Asn Ser Arg His Pro 
40 195 200 205 

TTC CTC ACA GCC CTG AAG TAC TCT TTC CAG ACC CAC GAC CGC CTC TGC 672 

Phe Leu Thr Ala Leu Lys Tyr Ser Phe Gin Thr His Asp Arg Leu Cys 
210 215 220 

45 

TTT GTC ATG GAG TAC GCC AAC GGG GGC GAG CTG TTC TTC CAC CTG TCC 720 

Phe Val Met Glu Tyr Ala Asn Gly Gly Glu Leu Phe Phe His Leu Ser 
225 230 235 240 

50 CGG GAA CGT GTG TTC TCC GAG GAC CGG GCC CGC TTC TAT GGC GCT GAG 768 

Arg Glu Arg Val Phe Ser Glu Asp Arg Ala Arg Phe Tyr Gly Ala Glu 
245 250 255 

ATT GTG TCA GCC CTG GAC TAC CTG CAC TCG GAG AAG AAC GTG GTG TAC 816 
55 lie Val Ser Ala Leu Asp Tyr Leu His Ser Glu Lys Asn Val Val Tyr 
260 265 270 
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Z S X z s s z z s z z 

285 

^ - z ^ z s 

300 

^ S „T 

315 

2u G^u Asn f '^'^ '^"^ ^-^^ ^'^^ ™° «<3<^ CTG GGC 

Leu Glu ASP Asn Asp Tyr Gly Arg Ala Val Asp Trp Trp Gly Leu Gly 

32S 330 335 

^ S SI? S ^ 

350 

365 

AAG AAG GAC CCC AAG CAG AGG CTT GGC GGG GGC TCC GAG GAC GCC AAr 
Lys Kys ASP Pro Lys Gin A.g .eu Gly Gly Gly se'r G^^ 



395 



400 



III lit Zt Sn r Ih" T ^'^^ ^° «^ '^TG 

Met Gin Has Arg Phe Phe Ala Gly He Val Trp Gin His Val 

410 

Sr Su ws SJs f ^ ^-^^ '^CG TCG GAG 

Tyr Glu Lys Lys Leu Ser Pro Pro Phe Lys Pro Gin Val Thr Ser Glu 

425 430 

S ^^"^ f ° '''''' ^'^'^ "^TC °CC CAG ATG ATC ACC 

Thr ASP Thr Arg Tyr Phe Asp Glu Glu Phe Thr Ala Gin mH 111 Jhr 

440 445 



ill Jh^ p"^ ^_ ^-^O GAG T..T GT^ GAC AGC 



Ti^ TH-*- r>v.« r» ^ '^'^^ GTG GAC AG 

lie Thr Pro Pro Asp Gin Asp Asp Ser Met Glu Cys Val Asp Se 

*55 4gQ 

z s z z z.zzi;^.--^. I- s J. 

"•'^ 475 480 



490 



495 



8 64 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



GAG 1392 
r Glu 



1440 



1488 



156 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 PCT/DK98/00145 

157 

ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAG GGC GAG GTA AAC GGC 1536 

Thr Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly 

500 505 510 

5 

CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAG GGC 1584 

His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 

515 520 525 

10 AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC 1632 
Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro 
530 535 540 

TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC 1680 
15 Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 
545 550 555 560 

CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG 1728 
Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 
20 565 570 575 

CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC 1776 

Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly 

580 585 590 

25 

AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG 1824 

Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 
595 600 605 

30 AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC 1872 
Asn Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie 

610 615 620 

CTG GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC 1920 
35 Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie 
625 630 635 * 640 

ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC 1968 
Met Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg 
40 645 650 655 

CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG 2016 
His Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin 
660 665 670 

45 

AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC 2064 
Asn Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr 
675 680 685 

50 CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT 2112 

Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp 
690 695 700 

CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC 2160 
55 His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly 
705 710 715 720 
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20 
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158 



ATG GAC GAG CTG TAG AAG TAA 

Met Asp Glu Leu Tyr Lys 2181 



55 



725 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 72 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Met Ser Asp Val Ala He Val Lys Glu Gly Trp Leu His Lys Arg Gly 

10 

Olu ryr lie ^ys Thr Trp Arg Pro Arg ^r Phe .eu Leu Lys IL Asp 
Oly Thr Phe lie Gly Tyr Lys Glu Irg Pro Gin Asp Val Isp Gin Arg 

Glu Ala Pro Leu Asn Asn Phe Se^ Val Ala Gin Cys gL Leu Met Lys 

60 



Thr Glu Arg Pro Arg Pro Asn Thr Phe He He Arg Cy. Leu Gin Trp 

30 Thr Thr val He Glu Arg Thr Phe His Val Glu Thr Pro Glu Glu Irg 

^5 90 



Glu Glu Trp Thr Thr Ala Xle Gin Thr Val Ala Asp Gly Leu Lys Lys 

Gin Glu Glu Glu Glu Met A.p Phe Irg Ser Gly Ser Pro Ser Asp Asn 

ser Gly Ala Glu clu Met Glu Z Ser Leu Ala Lys III Lys His Arg 

val Thr Met Asn Glu Phe Glu Tyr Leu Lys Leu Itu Gly Lys Gly Thr 

40 Phe Gly Lys Val He Leu Val Lys Glu Lys III Thr Gly Arg Tyr 

170 

Ala Met Lys lie Leu Lys Lys Glu Val He Val Ala Lys Asp g!u Val 
45 ""'^ "'^ If, -^^^ val Leu Gin Asn Ser Irg His Pro 

Phe Leu Thr Ala Leu Lys Tyr s^r Phe Gin Thr His Arg Leu Cys 

Phe val Met Clu Tyr Ala Asn Gly Gly Glu Leu III Phe His Leu Ser 

50 Arg Glu Arg Val Phe Ser Glu Asp Arg Ala Ar^ Phe Tyr Gly Ala 111 

250 



lie val ser Ala Leu Asp Tyr Leu His Ser Glu Lys Asn Val III Tyr 

Arg ASP Leu Lys Leu Glu Asn Leu ^et Leu Asp Lys Asp Gl'y His He 

280 



I.ys He Thr ASP Phe Gly Leu C^s Lys Glu Gly He !ys Asp Gly Ala 
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290 295 300 

Thr Met Lys Thr Phe Cys Gly Thr Pro Glu Tyr Leu Ala Pro Glu Val 
305 310 315 320 

Leu Glu Asp Asn Asp Tyr Gly Arg Ala Val Asp Trp Trp Gly Leu Gly 
5 325 330 335 

Val Val Met Tyr Glu Met Met Cys Gly Arg Leu Pro Phe Tyr Asn Gin 

340 345 350 

Asp His Glu Lys Leu Phe Glu Leu He Leu Met Glu Glu He Arg Phe 
355 360 365 

10 Pro Arg Thr Leu Gly Pro Glu Ala Lys Ser Leu Leu Ser Gly Leu Leu 
370 375 380 

Lys Lys Asp Pro Lys Gin Arg Leu Gly Gly Gly Ser Glu Asp Ala Lys 
385 390 395 400 

Glu lie Met Gin His Arg Phe Phe Ala Gly He Val Trp Gin His Val 
15 405 410 415 

Tyr Glu Lys Lys Leu Ser Pro Pro Phe Lys Pro Gin Val Thr Ser Glu 

420 425 430 

Thr Asp Thr Arg Tyr Phe Asp Glu Glu Phe Thr Ala Gin Met He Thr 
435 440 445 

20 He Thr Pro Pro Asp Gin Asp Asp Ser Met Glu Cys Val Asp Ser Glu 
450 455 460 

Arg Arg Pro His Phe Pro Gin Phe Ser Tyr Ser Ala Ser Ser Thr Ala 
465 470 475 480 

Ser Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe 
25 485 490 495 

Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly 

500 505 510 

His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 
515 520 525 

30 Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro 
530 535 540 

Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 
545 550 555 560 

Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 
35 565 570 575 

Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly 

580 585 590 

Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 
595 600 605 

40 Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He 
610 615 620 

Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He 
625 630 635 640 

Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg 
45 645 650 655 

His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin 

660 665 670 

Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr 
675 680 685 

50 Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp 
690 695 700 

His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly 
705 710 715 720 

Met Asp Glu Leu Tyr Lys 
55 725 
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(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 2751 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .274 8 
(D) OTHER INFORMATION: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO- 72- 

5 10 15 

S S ^ s ?n S S S ^. i S ?S 



40 45 



=5 60 



S =JJ -J - ™ - - - ^ ra. c„ 

65 ^ ^^-^ Lys Arg Cys His Glu 

75 

s:^ IS ?s s r r -^-^^ '^'^^ 

^er cys Pro Gly Ala Asp Lys Gly Pro Asp Thr Asp 



90 95 



z ^ s z s - s - s s s 



105 



110 



s ^ 51 z SI s s s SI ^r. SI s; 



"5 



ATC AAT GAC CCT AGC CTC Tnn r-r-^ nmr, 

lie Asn ASP Pro Ser L'u Cys' S Jit Z Ss of r 

145 ^ ™^ Lys Arg Gly 

160 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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CGG ATT TAT CTG AAG GCT GAG GTC ACT GAT GAA AAG CTC CAC GTC ACG 528 

Arg lie Tyr Leu Lys Ala Glu Val Thr Asp Glu Lys Leu His Val Thr 
165 170 175 

5 

GTA CGA GAT GCA AAA AAT CTA ATC CCT ATG GAT CCA AAT GGG CTT TCG 576 

Val Arg Asp Ala Lys Asn Leu lie Pro Met Asp Pro Asn Gly Leu Ser 
180 185 190 

10 GAT CCT TAT GTG AAG CTG AAA CTA ATC CCT GAC CCC AAG AAT GAG AGC 624 

Asp Pro Tyr Val Lys Leu Lys Leu lie Pro Asp Pro Lys Asn Glu Ser 
195 200 205 

AAA CAG AAA ACC AAA ACC ATC CGC TCC TUVC CTG AAT CCT CAG TGG AAT 672 
15 Lys Gin Lys Thr Lys Thr lie Arg Ser Asn Leu Asn Pro Gin Trp Asn 
210 215 220 

GAG TCC TTC ACG TTC AAA TTA AAA CCT TCA GAC AAA GAC CGG CGA CTG 720 
Glu Ser Phe Thr Phe Lys Leu Lys Pro Ser Asp Lys Asp Arg Arg Leu 
20 225 230 235 240 

TCT GTA GAA ATC TGG GAC TGG GAT CGG ACG ACT CGG AAT GAC TTC ATG 768 

Ser Val Glu lie Trp Asp Trp Asp Arg Thr Thr Arg Asn Asp Phe Met 
245 250 255 

25 

GGA TCC CTT TCC TTT GGT GTC TCA GAG CTA ATG AAG ATG CCG GCC AGT 816 

Gly Ser Leu Ser Phe Gly Val Ser Glu Leu Met Lys Met Pro Ala Ser 
260 265 270 

30 GGA TGG TAT AAA GCT CAC AAC CAA GAA GAG GGC GAA TAT TAC AAC GTG 864 

Gly Trp Tyr Lys Ala His Asn Gin Glu Glu Gly Glu Tyr Tyr Asn Val 
275 280 285 

CCC ATT CCA GAA GGA GAT GAA GAA GGC AAC ATG GAA CTC AGG CAG AAG 912 
35 Pro lie Pro Glu Gly Asp Glu Glu Gly Asn Met Glu Leu Arg Gin Lys 
290 295 300 

TTT GAG AAA GCC AAG CTA GGT CCT GTT GGT AAC AAA GTC ATC AGC CCT 960 
Phe Glu Lys Ala Lys Leu Gly Pro Val Gly Asn Lys Val lie Ser Pro 
40 305 310 315 320 

TCA GAA GAC AGA AAG CAA CCA TCC AAC AAC CTG GAC AGA GTG AAA CTC 1008 

Ser Glu Asp Arg Lys Gin Pro Ser Asn Asn Leu Asp Arg Val Lys Leu 

325 330 335 

45 

ACA GAC TTC AAC TTC CTC ATG GTG CTG GGG AAG GGG AGT TTT GGG AAG 1056 

Thr Asp Phe Asn Phe Leu Met Val Leu Gly Lys Gly Ser Phe Gly Lys 
340 345 350 

50 GTG ATG CTT GCT GAC AGG AAG GGA ACG GAG GAA CTG TAC GCC ATC AAG 1104 
Val Met Leu Ala Asp Arg Lys Gly Thr Glu Glu Leu Tyr Ala lie Lys 
355 360 365 

ATC CTG AAG AAG GAC GTG GTG ATC CAG GAC GAC GAC GTG GAG TGC ACC 1152 
55 lie Leu Lys Lys Asp Val Val lie Gin Asp Asp Asp Val Glu Cys Thr 
370 375 380 
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s s s s 2J 2^ ™ s - 



395 



400 



ACA CAG CTG CAC TCC TGC TTC CAG ACA GTG GAC CGG CTG TAG TTC GTC 
Thr Gin Leu His Ser Cys Phe Gin Thr uaI a=„ « , 
405 ^ 



410 



ATG GAA TAG GTC AAC GGC GGG GAT CTT ATG TAG CAC ATT PAf- n»» r^r. 
Met Glu ryr Val Asn Gly Gly Asp Leu Me^ His tJI Sn G^J 



425 



430 



s 5^ s s s s 



440 



460 



iTu Sit r ^ <^-c 

465 J,^ "'^'^ He Lys He Ala 



^"^5 480 



GAC TTC GGG ATG TGC AAG GAA CAC AT<5 ATG GAT GGA GTC ACG ACC AGG 
ASP Phe Gly Met Cys Lys Glu His Met Met Asp Gly vTl Jhr Jhr Tc, 



*85 490 495 



505 



510 



CAG CCG TAC GGG AAG TCT GTA GAT TGG TGG GCG TAC G6T GTG CTG CTG 
Gin Pro Tyr Gly Lys Ser Val Asp Trp Trp Ala Tyr G^y vTx lIu Su 



520 



525 



J- S - s S Z Z o" 



535 540 



SI s s =j» - - s s ^ - 

555 560 

IIu sir J^s G?; SI ITr ^ 

i^ys Glu Ala val ser He Cys Lys Gly Leu Met Thr Lys Gin 
565 570 5,5 

Pvl 111 ^ 7° ^"'^ ^'^^ GAG AGG GAT GTC AGA 

Pro Ala Lys Arg Leu Gly Cys Gly Pro Glu Gly Glu Arg Asp vll Arg 



585 



590 



605 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 
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GAG ATC CAA CCA 
Glu lie Gin Pro 
610 

5 

AAC TTT GAC AAG 
Asn Phe Asp Lys 
625 

10 GAT CAG CTG GTC 
Asp Gin Leu Val 



TOG TAT GTC AAC 
15 Ser Tyr Val Asn 
660 

GGG CGC GCC ATG 
Gly Arg Ala Met 
20 675 

ATT CTT GTT GAA 
He Leu Val Glu 
690 

25 

AGT GGA GAG GGT 
Ser Gly Glu Gly 
705 

30 TTT ATT TGC ACT 

Phe He Cys Thr 



ACT ACT CTC ACT 
35 Thr Thr Leu Thr 
740 

ATG AAA CAG CAT 
Met Lys Gin His 
40 755 

CAG GAA AGA ACT 
Gin Glu Arg Thr 
770 

45 

GCT GAA GTC AAG 
Ala Glu Val Lys 
785 

50 AAA GGT ATT GAT 
Lys Gly He Asp 



GAA TAC AAT TAT 
55 Glu Tyr Asn Tyr 
820 



CCA TTC AAG CCC AAA 
Pro 'Phe Lys Pro Lys 
615 

TTC TTC ACG CGA GGA 
Phe Phe Thr Arg Gly 
630 

ATT GCT AAC ATA GAC 
He Ala Asn He Asp 
645 

CCC CAG TTT GTG CAC 
Pro Gin Phe Val His 
665 

AGT AAA GGA GAA GAA 
Ser Lys Gly Glu Glu 
680 

TTA GAT GCC GAT GTT 
Leu Asp Gly Asp Val 
695 

GAA GGT GAT GCA ACA 
Glu Gly Asp Ala Thr 
710 

ACT GGG AAG CTA CCT 
Thr Gly Lys Leu Pro 
725 

TAT GGT GTT CAA TGC 
Tyr Gly Val Gin Cys 
745 

GAC TTT TTC AAG AGT 
Asp Phe Phe Lys Ser 
760 

ATA TTT TAC AAA GAT 
He Phe Tyr Lys Asp 
775 

TTT GAA GGT GAT ACC 
Phe Glu Gly Asp Thr 
790 

TTT AAA GAA GAT GGA 
Phe Lys Glu Asp Gly 
805 

AAC TCA CAT AAT GTA 
Asn Ser His Asn Val 
825 



GTG TGT GGC AAA GGA 
Val Cys Gly Lys Gly 
620 

CAG CCT GTC TTA ACA 
Gin Pro Val Leu Thr 
635 

CAA TCT GAT TTT GAA 
Gin Ser Asp Phe Glu 
650 

CCA ATC TTG CAA AGT 
Pro He Leu Gin Ser 
670 

CTT TTC ACT GGA GTT 
Leu Phe Thr Gly Val 
685 

AAT GGG CAA AAA TTC 
Asn Gly Gin Lys Phe 
700 

TAC GGA AAA CTT ACC 
Tyr Gly Lys Leu Thr 
715 

GTT CCA TGG CCA ACG 
Val Pro Trp Pro Thr 
730 

TTT TCT AGA TAC CCA 
Phe Ser Arg Tyr Pro 
750 

GCC ATG CCC GAA GGT 
Ala Met Pro Glu Gly 
765 

GAC GGG AAC TAC AAG 
Asp Gly Asn Tyr Lys 
780 

CTT GTT AAT AGA ATC 
Leu Val Asn Arg He 
795 

AAC ATT CTT GGA CAC 
Asn He Leu Gly His 
810 

TAC ATC ATG GCA GAC 
Tyr He Met Ala Asp 
830 



GCA GAA 1872 
Ala Glu 



CCA CCA 1920 
Pro Pro 
640 

GGG TTC 1968 

Gly Phe 
655 

GCA GTA 2016 
Ala Val 



GTC CCA 2064 
Val Pro 



TCT GTT 2112 
Ser Val 



CTT AAA 2160 
Leu Lys 
720 

CTT GTC 2208 

Leu Val 

735 

GAT CAT 2256 
Asp His 



TAT GTA 2304 
Tyr Val 



ACA CGT 2352 
Thr Arg 



GAG TTA 2400 

Glu Leu 
800 

AAA ATG 244 8 

Lys Met 

815 

AAA CCA 2496 
Lys Pro 
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845 



2544 



S Ser ^ IIu 1?" f ^ ""^^ ^ ^^-^ ATT GGC 

ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly 

S S 2 r S S ?S 

875 QgQ 

6CC CTT TCC AAA GAT CCC AAC GAA AAG AGA GAT CAC ATG ATC CTT CTT 
Ala Leu ser Lys Asp Pro Asn Glu Lys Arg Asp His Met ll. Zl 

890 895 

G?u SI ?it ?s aS it. s r r °- 

905 

AAA CCT CAG GAG TAA 

Lys Pro Gin Glu 2751 
915 



2592 



2640 



2688 



2736 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 916 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Met Ala ASP val Tyr Pro Ala Asn Asp Ser Thr Ala Ser Gin Asp Val 

Ala Asn Arg Phe Ala Arg Lys Gly Ala IL Arg Gin Lys Asn vll His 

Glu val Lys Asp His Lys Phe lie IL Arg Phe Phe Lys gL Pro Thr 

Phe cys ser His Cys Thr Asp IL He Trp Gly Phe Gly Lys Gin Gly 

Phe Gin cys Gin Val Cys Cys Phe Val Val His ^^s Arg Cys His Glu 

Phe val Thr Phe Ser Cys Pro Gly Ala Asp ^ys Gly Pro Asp Thr Isp 

ASP Pro Arg Ser Lys His Lys Phe Lys He His Thr Tyr Gly Ser Pro 

Thr Phe cys Asp His Cys Gly ser IZ Leu oyr Gly Leu III His Gin 

_ 120 125 

Gly Met Lys Cys Asp Thr Cys Asp Met Asn Val His Asn Gin Cys Val 
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130 135 140 

lie Asn Asp Pro Ser Leu Cys Gly Met Asp His Thr Glu Lys Arg Gly 
145 150 155 160 

Arg lie Tyr Leu Lys Ala Glu Val Thr Asp Glu Lys Leu His Val Thr 
5 165 170 175 

Val Arg Asp Ala Lys Asn Leu lie Pro Met Asp Pro Asn Gly Leu Ser 

180 185 190 

Asp Pro Tyr Val Lys Leu Lys Leu lie Pro Asp Pro Lys Asn Glu Ser 
195 200 205 

10 Lys Gin Lys Thr Lys Thr lie Arg Ser Asn Leu Asn Pro Gin Trp Asn 
210 215 220 

Glu Ser Phe Thr Phe Lys Leu Lys Pro Ser Asp Lys Asp Arg Arg Leu 
225 230 235 240 

Ser Val Glu lie Trp Asp Trp Asp Arg Thr Thr Arg Asn Asp Phe Met 
15 245 250 255 

Gly Ser Leu Ser Phe Gly Val Ser Glu Leu Met Lys Met Pro Ala Ser 

260 265 270 

Gly Trp Tyr Lys Ala His Asn Gin Glu Glu Gly Glu Tyr Tyr Asn Val 
275 280 285 

20 Pro lie Pro Glu Gly Asp Glu Glu Gly Asn Met Glu Leu Arg Gin Lys 
290 295 300 

Phe Glu Lys Ala Lys Leu Gly Pro Val Gly Asn Lys Val lie Ser Pro 
305 310 315 320 

Ser Glu Asp Arg Lys Gin Pro Ser Asn Asn Leu Asp Arg Val Lys Leu 
25 325 330 335 

Thr Asp Phe Asn Phe Leu Met Val Leu Gly Lys Gly Ser Phe Gly Lys 

340 345 350 

Val Met Leu Ala Asp Arg Lys Gly Thr Glu Glu Leu Tyr Ala lie Lys 
355 360 365 

30 lie Leu Lys Lys Asp Val Val lie Gin Asp Asp Asp Val Glu Cys Thr 
370 375 380 

Met Val Glu Lys Arg Val Leu Ala Leu Leu Asp Lys Pro Pro Phe Leu 
385 390 395 400 

Thr Gin Leu His Ser Cys Phe Gin Thr Val Asp Arg Leu Tyr Phe Val 
35 405 410 415 

Met Glu Tyr Val Asn Gly Gly Asp Leu Met Tyr . His He Gin Gin Val 

420 425 430 

Gly Lys Phe Lys Glu Pro Gin Ala Val Phe Tyr Ala Ala Glu He Ser 
435 440 445 

40 He Gly Leu Phe Phe Leu His Lys Arg Gly He He Tyr Arg Asp Leu 
450 455 460 

Lys Leu Asn Asn Val Met Leu Asn Ser Glu Gly His He Lys He Ala 
465 470 475 480 

Asp Phe Gly Met Cys Lys Glu His Met Met Asp Gly Val Thr Thr Arg 
45 485 490 495 

Thr Phe Cys Gly Thr Pro Asp Tyr He Ala Pro Glu He He Ala Tyr 

500 505 510 

Gin Pro Tyr Gly Lys Ser Val Asp Trp Trp Ala Tyr Gly Val Leu Leu 
515 520 525 

50 Tyr Glu Met Leu Ala Gly Gin Pro Pro Phe Asp Gly Glu Asp Glu Asp 
530 535 540 

Glu Leu Phe Gin Ser He Met Glu His Asn Val Ser Tyr Pro Lys Ser 
545 550 555 560 

Leu Ser Lys Glu Ala Val Ser He Cys Lys Gly Leu Met Thr Lys Gin 
55 565 570 575 

Pro Ala Lys Arg Leu Gly Cys Gly Pro Glu Gly Glu Arg Asp Val Arg 
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585 590 
Glu His Ala Phe Phe Arg Arg He Asp Trp Glu Lys Leu Glu Asn Arg 

600 605 
Glu He Gin Pro Pro Phe Lys Pro Lys Val Cys Gly Lys Gly Ala Glu 

610 615 620 

Asn Phe Asp Lys Phe Phe Thr Arg Gly Gin Pro Val Leu Thr Pro Pro 

630 635 640 

Asp Gin Leu Val He Ala Asn He Asp Gin Ser Asp Phe Glu Gly Phe 

650 655 
Ser Tyr Val Asn Pro Gin Phe Val His Pro He Leu Gin Ser Ala Val 

665 670 
Gly Arg Ala Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro 

^"75 680 685 

He Leu Val Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val 

695 700 
Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 
''t^ 715 720 

Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val 

725 730 
Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His 

■^^0 745 750 

Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val 

■^55 760 765 

Gin Glu Arg Thr He Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg 
25 "770 775 780 

Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu 
''^^ 790 795 800 

Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Met 

on 815 
30 Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Pro 
520 825 830 

Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Lys Asp 

835 640 845 

Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly 
^5 «50 855 860 

Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser 
ff5 870 875 880 

Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu 

895 

40 Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr 
900 905 910 

Lys Pro Gin Glu 
915 

^5 (2) INFORMATION FOR SEQ ID NO:74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



50 



55 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
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(B) LOCATION: 1. . .2154 
(D) OTHER INFORMATION: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

5 

ATG TCG TCC ATC TTG CCA TTC ACG CCG CCA GTT GTG AAG AGA CTG CTG 48 
Met Ser Ser lie Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu 
15 10 15 

10 GGA TGG AAG AAG TCA GCT GGT GGG TCT GGA GGA GCA GGC GGA GGA GAG 96 
Gly Trp Lys Lys Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu 
20 25 30 

CAG AAT GGG CAG GAA GAA AAG TGG TGT GAG AAA GCA GTG AAA AGT CTG 144 
15 Gin Asn Gly Gin Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu 
35 40 45 

GTG AAG AAG CTA AAG AAA ACA GGA CGA TTA GAT GAG CTT GAG AAA GCC 192 
Val Lys Lys Leu Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala 
20 50 55 60 

ATC ACC ACT CAA AAC TGT AAT ACT AAA TGT GTT ACC ATA CCA AGC ACT 240 
lie Thr Thr Gin Asn Cys Asn Thr Lys Cys Val Thr lie Pro Ser Thr 
65 70 75 80 

25 

TGC TCT GAA ATT TGG GGA CTG AGT ACA CCA AAT ACG ATA GAT CAG TGG 288 
Cys Ser Glu lie Trp Gly Leu Ser Thr Pro Asn Thr lie Asp Gin Trp 
65 90 95 

30 GAT ACA ACA GGC CTT TAG AGC TTC TCT GAA CAA ACC AGG TCT CTT GAT 336 
Asp Thr Thr Gly Leu Tyr Ser Phe Ser Glu Gin Thr Arg Ser Leu Asp 
100 105 110 

GGT CGT CTC CAG GTA TCC CAT CGA AAA GGA TTG CCA CAT GTT ATA TAT 3 84 

35 Gly Arg Leu Gin Val Ser His Arg Lys Gly Leu Pro His Val lie Tyr 
115 120 125 

TGC CGA TTA TGG CGC TGG CCT GAT CTT CAC AGT CAT CAT GAA CTC AAG 432 
Cys Arg Leu Trp Arg Trp Pro Asp Leu His Ser His His Glu Leu Lys 
40 130 135 140 

GCA ATT GAA AAC TGC GAA TAT GCT TTT AAT CTT AAA AAG GAT GAA GTA 480 

Ala lie Glu Asn Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val 
145 150 155 160 

45 

TGT GTA AAC CCT TAC CAC TAT CAG AGA GTT GAG ACA CCA GTT TTG CCT 528 

Cys Val Asn Pro Tyr His Tyr Gin Arg Val Glu Thr Pro Val Leu Pro 
165 170 175 

50 CCA GTA TTA GTG CCC CGA CAC ACC GAG ATC CTA ACA GAA CTT CCG CCT 576 
Pro Val Leu Val Pro Arg His Thr Glu lie Leu Thr Glu Leu Pro Pro 
180 185 190 

CTG GAT GAC TAT ACT CAC TCC ATT CCA GAA AAC ACT AAC TTC CCA GCA 624 
55 Leu Asp Asp Tyr Thr His Ser lie Pro Glu Asn Thr Asn Phe Pro Ala 
195 200 205 
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GGA ATT GAG CCA CAG ACT AAT TAT ATT era naa ^r^r. 
Oly Xle C.U P.O ..n ser .sn ill IT. Tu ?S Pr^ ttl t^o 



215 220 



f ^ ^^'^ CAG TTG AAT CAA AGT 

Tyr lie Ser Glu Asp Gly Glu Thr Ser Asp Gin Gin Leu Asn Ser 



235 



240 



ie"? Sp ?s °s r 

P iir Gly ser Pro Ala Glu Leu Ser Pro Thr Thr Leu Ser Pro 



250 255 



™ ™ s= SI s S S 

270 

S cyl ITr il" T "'^^ «AA 
Trp cys ser He Ala Tyr Tyr Glu Leu Asn Gin Arg Val Gly Glu 

280 285 

s SI zi I- ir. ^ I- ?s z 

300 

Pro ITr s2 G?u Arf "™ ™ °" 

305 " '^''^ ^= Leu Ser Asn Val Asn 

315 

.-^i s s - - J- - s I- ^ i 

330 

S Zu Sr ?Ie Glv T ^ °" "'^^ ^"^^ 

Tyr Tyr He Gly Gly Glu Val Phe Ala Glu Cys Leu Ser Asp 



345 



350 



Sa He IZ vlt Sn T f " 

lie Phe Val om Ser Pro Asn Cys Asn Gin Arg Tyr Gly Trp 



365 



~- va^ cys Lys He Pro Pro Gly Cys Asn Leu Lys He 

•''S 380 

Zl Sn IZ 111 SI r "^"^ 

385 ^^'^ Ala Gin ser val Asn Gin 



395 



400 



"y ^ Sa ?Il' Sr X r T." 

40S ^""^ "^"^ ^« T**^ Arg Met 



410 



415 



sii s s= s ^ r r 

420 ^^"^ ^""^ ^""S Thr Val Thr 



425 



430 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1248 



1296 
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AGT ACT CCT TGC TGG ATT GAA CTT CAT CTG AAT GGA CCT CTA CAG TGG 1344 
Ser Thr Pro Cys Trp lie Glu Leu His Leu Asn Gly Pro Leu Gin Trp 
435 440 445 

5 

TTG GAG AAA GTA TTA ACT CAG ATG GGA TCC CCT TCA GTG CGT TGC TCA 13 92 

Leu Asp Lys Val Leu Thr Gin Met Gly Ser Pro Ser Val Arg Cys Ser 
450 455 460 

10 AGC ATG TCA TGG GTA CCG CGG GCC CGG GAT CCA CCG GTC GCC ACC ATG 1440 
Ser Met Ser Trp Val Pro Arg Ala Arg Asp Pro Pro Val Ala Thr Met 
465 470 475 480 

GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC 1488 
15 Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 

485 490 495 

GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG 1536 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 500 505 510 

GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC 1584 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 

515 520 525 

25 

ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG 1632 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
530 535 540 

30 ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG 1680 
Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
545 550 555 560 

CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC 1728 
35 His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

565 570 575 

ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG 1776 
Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
40 580 585 590 

AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC 1824 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
595 600 605 

45 

GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC 1872 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
610 615 620 

50 TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC 192 0 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
625 630 635 640 

ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG 1968 
55 He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 

645 650 655 
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Gil 11^ '^^'^ ^"""^ ATC GGC GAG GGC CCC 

Gin Leu Ala Asp Hxs .yr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 

665 

GTC CTG CTG CCC GAC AAC CAC TAC CTO AGC ACC GAG TCC GCC CTG AGC 
val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala lIu S^r 

680 685 

AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG 2112 
Lys ASP Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu vTl 



690 

700 



695 



710 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Met ser Ser He Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu 

Gly Trp Lys Lys Ser Ala Gly Gly Ser Gl'y Gly Ala Gly Gly Gly Glu 

Gin Asn Gly Gin Glu Glu Lys Trp JJs Glu Lys Ala Val lys Ser Leu 

val Lys Lys Leu Lys Lys Thr lly Arg Leu Asp Glu llu Glu Lys Ala 

60 

lie Thr Thr Gin Asn Cys Asn Thr Lys Cys Val Thr He Pro Ser Thr 

70 75 
cys ser Glu He Tp Gly Leu Ser Thr Pro Asn Thr He Asp Gin rip 



ASP Thr Thr Gly Leu Tyr Ser Phe Ser gL Gin Thr Arg Ser Leu Asp 



100 



Gly Arg Leu Gin Val Ser His Arg HI Gly Leu Pro His HI He Tyr 
11= 120 



cys Arg Leu Trp Arg Trp Pro Asp Leu His Ser His His Glu Leu Lys 

140 

Ala He Glu Asn Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val 
cys val Asn Pro Tyr His Tyr Gin Arg Val III Thr Pro Val Leu Pro 
Pro val Leu Val Pro Arg His Thr Glu He Leu Thr Glu Leu Pro Pro 
Leu ASP Asp Tyr Thr His Ser He Pro Glu Asn Thr Asn Phe Pro Ala 



2016 



2064 



ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 
Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu T^r 
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195 200 205 

Gly lie Glu Pro Gin Ser Asn Tyr lie Pro Glu Thr Pro Pro Pro Gly 

210 215 220 

Tyr lie Ser Glu Asp Gly Glu Thr Ser Asp Gin Gin Leu Asn Gin Ser 
5 225 230 235 240 

Met Asp Thr Gly Ser Pro Ala Glu Leu Ser Pro Thr Thr Leu Ser Pro 

245 250 255 

Val Asn His Ser Leu Asp Leu Gin Pro Val Thr Tyr Ser Glu Pro Ala 
260 265 270 

10 Phe Trp Cys Ser lie Ala Tyr Tyr Glu Leu Asn Gin Arg Val Gly Glu 
275 280 285 

Thr Phe His Ala Ser Gin Pro Ser Leu Thr Val Asp Gly Phe Thr Asp 

290 295 300 

Pro Ser Asn Ser Glu Arg Phe Cys Leu Gly Leu Leu Ser Asn Val Asn 
15 305 310 315 320 

Arg Asn Ala Thr Val Glu Met Thr Arg Arg His lie Gly Arg Gly Val 

325 330 335 

Arg Leu Tyr Tyr He Gly Gly Glu Val Phe Ala Glu Cys Leu Ser Asp 
340 345 350 

20 Ser Ala He Phe Val Gin Ser Pro Asn Cys Asn Gin Arg Tyr Gly Trp 
355 360 365 

His Pro Ala Thr Val Cys Lys He Pro Pro Gly Cys Asn Leu Lys lie 

370 375 380 

Phe Asn Asn Gin Glu Phe Ala Ala Leu Leu Ala Gin Ser Val Asn Gin 
25 385 390 395 400 

Gly Phe Glu Ala Val Tyr Gin Leu Thr Arg Met Cys Thr He Arg Met 

405 410 415 

Ser Phe Val Lys Gly Trp Gly Ala Glu Tyr Arg Arg Gin Thr Val Thr 
420 425 430 

30 Ser Thr Pro Cys Trp He Glu Leu His Leu Asn Gly Pro Leu Gin Trp 
435 440 445 

Leu Asp Lys Val Leu Thr Gin Met Gly Ser Pro Ser Val Arg Cys Ser 

450 455 460 

Ser Met Ser Trp Val Pro Arg Ala Arg Asp Pro Pro Val Ala Thr Met 
35 465 470 475 480 

Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 

485 490 495 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
500 505 510 

40 Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
515 520 525 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 

530 535 540 

Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
45 545 550 555 560 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

565 570 575 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
580 585 590 

50 Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
595 600 605 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 

610 615 620 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
55 625 630 635 640 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 



171 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/0K98/OO145 



172 

Gin Leu Ala Asp nil ryr Gin OIn Aan ^h^ Pro He Gly Asp Gly Pro 

670 



^ Val Leu Leu Pro Asp Asn His Leu Ser Thr Gin Ser All Leu Ser 

l-ys ASP Pro Asn Glu Lys Arg Isp His Met Val Leu III oiu Phe Val 

Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu III xyr Lys 
) '^^^ 715 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 97 base pairs 

> (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 ... 2394 
(D) OTHER INFORMATION: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

-7 2= s s ?s z - s z z I- ^ 



10 



15 



s s z ir. 2- ^. -I - „^ ^ 



40 45 



Su r r ^™ ^™ 

^ys Lys Asp Glu Leu Asp Ser Leu He Thr Ala He Thr Thr Asn 



55 60 



IT. S ^ IT. SI ?S T. S'.^^-r ^ 

SS ^■^^ Thr Leu Asp Gly 



80 



s m X ir, SI s s SI z iT. z ir. 



90 95 



CGT CTC TGG AGG TGG CCT GAT CTT rt^n tnatv 

.„ Z - - - - - »T 



105 XIO 



AAA TAT TGT CAG TAT GCG TTT GAC TTA AAA TGT GAT AGT GTC TOT GTO 



48 



96 



144 



192 



240 



266 



336 



384 
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Lys Tyr Cys Gin Tyr Ala Phe Asp Leu Lys Cys Asp Ser Val Cys Val 
115 120 125 



AAT CCA TAT CAC TAC GAA CGA GTT GTA TCA CCT GGA ATT GAT CTC TCA 432 
5 Asn Pro Tyr His Tyr Glu Arg Val Val Ser Pro Gly lie Asp Leu Ser 
130 135 140 



GGA TTA ACA CTG GAG AGT AAT GOT CCA TCA AGT ATG ATG GTG AAG GAT 4 80 

Gly Leu Thr Leu Gin Ser Asn Ala Pro Ser Ser Met Met Val Lys Asp 
10 145 150 155 160 

GAA TAT GTG CAT GAC TTT GAG GGA CAG CCA TCG TTG TCC ACT GAA GGA 52 B 

Glu Tyr Val His Asp Phe Glu Gly Gin Pro Ser Leu Ser Thr Glu Gly 
165 170 175 



15 



CAT TCA ATT CAA ACC ATC CAG CAT CCA CCA AGT AAT CGT GCA TCG ACA 576 
His Ser lie Gin Thr lie Gin His Pro Pro Ser Asn Arg Ala Ser Thr 
180 185 190 



20 GAG ACA TAC AGC ACC CCA GCT CTG TTA GCC CCA TCT GAG TCT AAT GCT 624 
Glu Thr Tyr Ser Thr Pro Ala Leu Leu Ala Pro Ser Glu Ser Asn Ala 
195 200 205 



ACC AGC ACT GCC AAC TTT CCC AAC ATT CCT GTG GCT TCC ACA AGT CAG 672 
25 Thr Ser Thr Ala Asn Phe Pro Asn lie Pro Val Ala Ser Thr Ser Gin 
210 215 220 



CCT GCC AGT ATA CTG GGG GGC AGC CAT AGT GAA GGA CTG TTG CAG ATA 720 
Pro Ala Ser lie Leu Gly Gly Ser His Ser Glu Gly Leu Leu Gin He 
30 225 230 235 240 



GCA TCA GGG CCT CAG CCA GGA CAG CAG CAG AAT GGA TTT ACT GGT CAG 768 
Ala Ser Gly Pro Gin Pro Gly Gin Gin Gin Asn Gly Phe Thr Gly Gin 
245 250 255 

35 

CCA GCT ACT TAC CAT CAT AAC AGC ACT ACC ACC TGG ACT GGA AGT AGG 816 
Pro Ala Thr Tyr His His Asn Ser Thr Thr Thr Trp Thr Gly Ser Arg 
260 265 270 



40 ACT GCA CCA TAC ACA CCT AAT TTG CCT CAC CAC CAA AAC GGC CAT CTT 864 
Thr Ala Pro Tyr Thr Pro Asn Leu Pro His His Gin Asn Gly His Leu 
275 280 285 



CAG CAC CAC CCG CCT ATG CCG CCC CAT CCC GGA CAT TAC TGG CCT GTT 912 
45 Gin His His Pro Pro Met Pro Pro His Pro Gly His Tyr Trp Pro Val 
290 295 300 

CAC AAT GAG CTT GCA TTC CAG CCT CCC ATT TCC AAT CAT CCT GCT CCT 960 
His Asn Glu Leu Ala Phe Gin Pro Pro He Ser Asn His Pro Ala Pro 
50 305 310 315 320 



GAG TAT TGG TGT TCC ATT GCT TAC TTT GAA ATG GAT GTT CAG GTA GGA 1008 
Glu Tyr Trp Cys Ser He Ala Tyr Phe Glu Met Asp Val Gin Val Gly 
325 330 335 

55 

GAG ACA TTT AAG GTT CCT TCA AGC TGC CCT ATT GTT ACT GTT GAT GGA 1056 
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Glu Thr Phe Lys Val Pro Ser Ser Cys Pro He Val Thr Val Asp Gly 
"° 3« 350 

TAC GTG GAC CCT TCT GGA GGA GAT CGC TTT TCT TTG GGT CAA CTC TCC 
Tyr val Asp Pro Ser Gly Gly Asp Arg Phe Cys l.eu Gly Ser 

360 

AAT GTC CAC AGG ACA GAA GCC ATT GAG AGA GCA AGG TTG CAC ATA QGC 
Asn val Has Arg Thr Glu Ala He Glu Arg Ala Arg Zu His He Gly 

AAA GGT GTG CAG TTG GAA TGT AAA GGT GAA GGT GAT GTT TGG GTC AGG 
jys Gly val Gin .en Glu Cys Lys Gly Glu Gly Asp vll Sp vll Arg 

395 400 

TCC CTT AGT GAC CAC GCG GTC TTT GTA CAG AGT TAC TAC TTA GAC AGA 
cys Leu ser Asp His Ala Val Phe Val Gin Ser ryr Zr ^.u Asp Arg 
"5 410 

GAA GCT GGG CGT GCA CCT GGA GAT GCT GTT CAT AAG ATC TAC CCA AGT 
Glu Ala Gly Arg Ala Pro Gly Asp Ala Val His Lys He pS Ser 

"° 425 430 

A^^ Sr He f^s f ? ^ "° ATG CAG 

Ala Tyr He Lys Val Phe Asp Leu Arg Gin Cys His Arg Gin Met Gin 

"5 440 

CAG CAG GCG GCT ACT GCA CAA GCT GCA GCA GCT GCC CAG GCA GCA GCC 
Gin Gin Ala Ala Thr Ala Gin Ala Ala Ala Ala Ala Gin Ala aS Sa 

455 460 

GTC GCA GGA AAC ATC CCT GGC CCA GGA TCA GTA GGT GGA ATA GCT CCA 
val Ala Gly Asn He Pro Gly Pro Gly ser Val Gly Gly llj P^ 

475 480 

GCT ATC AGT CTC TCA GCT GCT GCT GGA ATT GGT GTT GAT GAC CTT CGT 
Ala He Ser Leu Ser Ala Ala Ala Gly He Gly Val Asp Asp Lu S 
485 490 

CGC TTA TCC ATA CTC AGG ATC AGT TTT GTG AAA GGC TGG GGA CCG GAT 
Arg Leu Cys He Leu Arg Met Ser Phe Val Lys Gly T^ Gly P^ AsJ 

505 

TAC CCA AGA CAG AGC ATC AAA GAA ACA CCT TCC TCG ATT GAA ATT CAC 
Tyr Pro Arg Gin Ser He Lys Glu Thr Pro Cys Trp .7. Ss 



520 



525 



TTA CAC CGG GCC CTC CAG CTC CTA GAC GAA GTA CTT CAT ACC ATG CCG 
Leu Hxs Arg Ala Leu Gin Leu Leu Asp Glu Val Leu His ilr Pro 

540 

?n IT. So Tn r 

lie Ala Asp Pro Gin Pro Leu Asp Trp Asp Pro Pro Val Ala Thr Met 

555 

GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 
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Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
565 570 575 

GAG CTG GAG GGC GAG GTA AAC GGC CAC AAG TTC AGO GTG TCC GGC GAG 17 76 

5 Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
580 585 590 

GGC GAG GGC GAT GCC ACC TAG GGC AAG CTG ACC CTG AAG TTC ATC TGC 1824 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
10 595 600 605 

ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG 1872 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 

610 615 620 

15 

ACC TAC GGC GTG CAG TGC TTC AGC CGC TAG CCC GAC CAC ATG AAG CAG 1920 

Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 

625 630 635 640 

20 CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC 1968 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
645 650 655 

ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG 2016 
25 Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
660 665 670 

AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC 2064 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 
30 675 680 685 

GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC 2112 

Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn 

690 695 700 

35 

TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC 2160 

Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
705 710 715 720 

40 ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG 2208 
lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val 
725 730 735 

CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC 2256 
45 Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 
740 745 750 

GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC 2304 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
50 755 760 765 

AAA GAC CCC AAC GAG PiAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG 23 52 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
770 775 780 



55 



ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 23 97 
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176 

Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 

795 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Met ASP Asn Met Ser He Thr Asn Thr Pro Thr Ser Asn Asp Ala Cys 

10 

X-eu ser lie Val His Ser Leu Met Cys His Arg oXn Cly oly llu Ser 

Glu X^r P^e Ala Lys Arg Ala lie Olu Ser Leu Val Lys lys Leu Lys 

Glu Lys Lys Asp Glu Leu Asp Ser Leu He Thr Ala Thr Thr Asn 

Gly Ala His Pro Ser Lys Cys Val Thr He Gin Thr Leu Asp Gly 

Arg Leu Gin Val Ala Gly Arg Lys Gly Phe Pro His Val He Tyr II 

Arg Leu Trp Arg Trp Pro Asp Leu His Lys Asn Glu Leu Lys Jis Val 

I.ys Tyr cys Gin Tyr Ala Phe Asp III Lys Cys Asp Ser III Cys Val 

Asn Pro Tyr His Xyr Glu Arg HI val ser P.o Gly He Asp Leu Ser 

Gly Leu Thr Leu Oln Ser Asn Ala Pro Ser Ser "^tl „et Val Lys Asp 

Glu Tyr val His Asp Phe Glu Gly Gin Pro Ser Leu Ser Thr Glu Gl" 

170 

His ser He Gin Thr He Gin His Pro Pro Ser Asn Arg Ala III Thr 



185 



Glu Thr Tyr Ser Thr Pro Ala Leu Leu Ala Pro ser Glu III Asn Ala 



200 



Thr ser Thr Ala Asn Phe Pro Asn He Pro Val Ala S^r Thr Ser Gin 



215 



Pro Ala ser He Leu Gly oi^ ser His Ser Glu lly Leu Leu Gin He 



230 



Ala ser Gly Pro Gin Pro Gly Gin Gin Gin III Gly Phe Thr Gly Itl 



245 



pro Ala Thr Tyr His His Asn Ser Thr ^hr Thr Trp Thr Gly Ser Arg 
«xr Ala Pro .yr Thr Pro Asn Leu III His His Gin Asn lly His Leu 
Gin His His Pro Pro Met Pro Pro" His Pro Gly His '^l p^o Val 

His Asn Glu Leu Ala Phe Gin Pro Pro He Ser III His Pro Ala Pro 



17 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 PCT/DK98/001 45 

177 

305 310 315 320 

Glu Tyr Trp Cys Ser lie Ala Tyr Phe Glu Met Asp Val Gin Val Gly 

325 330 335 

Glu Thr Phe Lys Val Pro Ser Ser Cys Pro lie Val Thr Val Asp Gly 
5 340 345 350 

Tyr Val Asp Pro Ser Gly Gly Asp Arg Phe Cys Leu Gly Gin Leu Ser 

355 360 365 

Asn Val His Arg Thr Glu Ala lie Glu Arg Ala Arg Leu His lie Gly 
370 375 380 

10 Lys Gly Val Gin Leu Glu Cys Lys Gly Glu Gly Asp Val Trp Val Arg 
385 390 395 400 

Cys Leu Ser Asp His Ala Val Phe Val Gin Ser Tyr Tyr Leu Asp Arg 

405 410 415 

Glu Ala Gly Arg Ala Pro Gly Asp Ala Val His Lys lie Tyr Pro Ser 
15 420 425 430 

Ala Tyr lie Lys Val Phe Asp Leu Arg Gin Cys His Arg Gin Met Gin 

435 440 445 

Gin Gin Ala Ala Thr Ala Gin Ala Ala Ala Ala Ala Gin Ala Ala Ala 
450 455 460 

20 Val Ala Gly Asn lie Pro Gly Pro Gly Ser Val Gly Gly lie Ala Pro 
465 470 475 480 

Ala lie ser Leu Ser Ala Ala Ala Gly lie Gly Val Asp Asp Leu Arg 

485 490 495 

Arg Leu Cys lie Leu Arg Met Ser Phe Val Lys Gly Trp Gly Pro Asp 
25 500 505 510 

Tyr Pro Arg Gin Ser lie Lys Glu Thr Pro Cys Trp lie Glu lie His 

515 520 525 

Leu His Arg Ala Leu Gin Leu Leu Asp Glu Val Leu His Thr Net Pro 
530 535 540 

30 lie Ala Asp Pro Gin Pro Leu Asp Trp Asp Pro Pro Val Ala Thr Met 
545 550 555 560 

Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 

565 570 575 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
35 580 585 590 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 

595 600 605 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
610 615 620 

40 Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
625 630 635 640 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

645 650 655 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
45 660 665 670 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 

675 680 685 

Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn 
690 695 700 

50 Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
705 710 715 720 

lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val 

725 730 735 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 
65 740 745 750 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
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"^55 j^- 
Uys ASP Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

'^'^^ 780 
Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
790 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 3138 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 1...3135 
(D) OTHER INFORMATION: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

111 r?"" l"""" T ""^^ <5AC GCG CTG CGC 

Met Ala Gly Trp lie Gin Ala Gin Gin Leu Gin Gly Asp Ala llu Arg 

5 10 15 

CAG ATG GAG GTG CTG TAC GGC CAG CAC TTC CCC ATC GAG GTC CGG CAC 
Gin Met Gin Val Leu Tyr Gly Gin His Phe Pro He Tu vll Sg Ss 
20 25 30 

TAG TTG GCC CAG TGG ATT GAG AGC CAG CCA TGG GAT GCC ATT GAC TTG 
Tyr Leu Ala Gin Trp He Glu Ser Gin Pro Trp Asp 111 S ^p Uu 

40 45 

GAC AAT CCC CAG GAC AGA GCC CAA GCC ACC CAG CTC CTG GAG GGC CTG 
ASP Asn Pro Gin Asp Arg Ala Gin Ala Thr Gin Leu leu Su G^y ITu 

60 

Val Gin Glu Leu Gin Lys Lys Ala Glu His Gin Val Gly Glu Asp Gly 



75 



SO 



TTT TTA CTO AAG ATC AAG CTG GGG CAC TAC GCC ACG CAG CTC CAG AAA 
Phe Leu Leu Lys lie Lys Leu Gly His Tyr Ala Thr Gin Leu Sn ^ 



?hr 'Zl f ''^'^ "° '^'^^ ^™ CGC TGC ATC CGG CAC ATT 

Thr Tyr Asp Arg Cys Pro Leu Glu Leu Val Arg Cys He Arg H^s S 



105 



110 



CTG TAC AAT GAA CAG AGG CTG GTC CGA GAA GCC AAC AAT TGC AGC TCT 
Leu Tyr Asn Glu Gin Arg Leu Val Arg Glu Ala Asn Asn Cys Ser Ser 

120 125 

CCG GCT GGG ATC CTG GTT GAC GCC ATG TCC CAG AAG CAC CTT CAG ATC 



48 



96 



144 



192 



240 



288 



336 



384 



432 
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Pro Ala Gly lie Leu Val Asp Ala Met Ser Gin Lys His Leu Gin lie 
130 135 140 

AAC CAG ACA TTT GAG GAG CTG CGA CTG GTC ACQ CAG GAC ACA GAG AAT 4 BO 

5 Asn Gin Thr Phe Glu Glu Leu Arg Leu Val Thr Gin Asp Thr Glu Asn 
145 150 155 160 

GAG CTG AAG AAA CTG CAG CAG ACT CAG GAG TAC TTC ATC ATC CAG TAG 52 8 

Glu Leu Lys Lys Leu Gin Gin Thr Gin Glu Tyr Phe He He Gin Tyr 
10 165 170 175 

CAG GAG AGC CTG AGG ATC CAA GCT CAG TTT GCC CAG CTG GCC CAG CTG 576 

Gin Glu Ser Leu Arg He Gin Ala Gin Phe Ala Gin Leu Ala Gin Leu 

180 185 190 

15 

AGC CCC CAG GAG CGT CTG AGC CGG GAG ACG GCC CTC CAG CAG AAG CAG 624 

Ser Pro Gin Glu Arg Leu Ser Arg Glu Thr Ala Leu Gin Gin Lys Gin 
195 200 205 

20 GTG TCT CTG GAG GCC TGG TTG CAG CGT GAG GCA CAG ACA CTG CAG CAG 672 

Val Ser Leu Glu Ala Trp Leu Gin Arg Glu Ala Gin Thr Leu Gin Gin 
210 215 220 

TAC CGC GTG GAG CTG GCC GAG AAG CAC CAG AAG ACC CTG CAG CTG CTG 720 
25 Tyr Arg Val Glu Leu Ala Glu Lys His Gin Lys Thr Leu Gin Leu Leu 
225 230 235 240 

CGG AAG CAG CAG ACC ATC ATC CTG GAT GAC GAG CTG ATC CAG TGG AAG 768 
Arg Lys Gin Gin Thr He He Leu Asp Asp Glu Leu He Gin Trp Lys 
30 245 250 255 

CGG CGG CAG CAG CTG GCC GGG AAC GGC GGG CCC CCC GAG GGC AGC CTG 816 

Arg Arg Gin Gin Leu Ala Gly Asn Gly Gly Pro Pro Glu Gly Ser Leu 
260 265 270 

35 

GAC GTG CTA CAG TCC TGG TGT GAG AAG TTG GCC GAG ATC ATC TGG CAG 864 

Asp Val Leu Gin Ser Trp Cys Glu Lys Leu Ala Glu He He Trp Gin 
275 280 285 

40 AAC CGG CAG CAG ATC CGC AGG GCT GAG CAC CTC TGC CAG CAG CTG CCC 912 
Asn Arg Gin Gin He Arg Arg Ala Glu His Leu Cys Gin Gin Leu Pro 
290 295 300 

ATC CCC GGC CCA GTG GAG GAG ATG CTG GCC GAG GTC AAC GCC ACC ATC 960 
45 He Pro Gly Pro Val Glu Glu Met Leu Ala Glu Val Asn Ala Thr He 
305 310 315 320 

ACG GAC ATT ATC TCA GCC CTG GTG ACC AGC ACA TTC ATC ATT GAG AAG 1008 
Thr Asp He He Ser Ala Leu Val Thr Ser Thr Phe He He Glu Lys 
50 325 330 335 

CAG CCT CCT CAG GTC CTG AAG ACC CAG ACC AAG TTT GCA GCC ACC GTA 1056 
Gin Pro Pro Gin Val Leu Lys Thr Gin Thr Lys Phe Ala Ala Thr Val 
340 345 350 



55 



CGC CTG CTG GTG GGC GGG AAG CTG AAC GTG CAC ATG AAT CCC CCC CAG 1104 
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Arg Leu Leu Val Gly Gly Lys Leu Asn Val His Met Asn Pro Pro Gin 

365 

ill ^ s s s s s s= s s s 

^« m""^ ^""^ '^^'^ AGT GCC CAC TTC 

Cys val Met Glu Tyr His Gin Ala Thr Gly Thr Leu Ser His JJe 

"5 410 

s ^^n^ iTr z si r r 

425 

460 

?I? ;il ;il ^ ^^•^ ^^-^ GCC ACG GOT 

Pro val val Val He Val His Gly Ser Gin Asp His Asn Ala Thr All 

475 

vl? "^"^ ^"^"^ °AG CCG GGC AGG GTG CCA TTT 

Thr val z,eu Trp Asp Asn Ala Phe Ala Glu Pro Gly Arg vl? pH 



490 



495 



17. Z ^ S 5= ^, 



505 



510 



Sit r^t aS g?; ?i? r '''' 

515 ^^"^ ^^'^ ^9 Leu Thr Lys Glu 

520 



525 



?ii pI; 12 aS Sn r r° 

530 ^^"^ A«» ser Ser His 



535 540 



555 560 

570 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 
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Val Met Glu Val Leu Lys Lys His His Lys Pro His Trp Asn Asp Gly 
580 585 590 

GCC ATC CTA GGT TTT GTG AAT AAG CAA CAG GCC CAC GAC CTG CTC ATC 1824 
5 Ala lie Leu Gly Phe Val Asn Lys Gin Gin Ala His Asp Leu Leu He 
595 600 605 

AAC AAG CCC GAC GGG ACC TTC TTG TTG CGC TTT AGT GAC TCA GAA ATC 1872 
Asn Lys Pro Asp Gly Thr Phe Leu Leu Arg Phe Ser Asp Ser Glu He 
10 610 615 620 

GGG GGC ATC ACC ATC GCC TGG AAG TTT GAC TCC CCG GAA CGC AAC CTG 1920 

Gly Gly He Thr He Ala Trp Lys Phe Asp Ser Pro Glu Arg Asn Leu 
625 630 635 640 

15 

TGG AAC CTG AAA CCA TTC ACC ACG CGG GAT TTC TCC ATC AGO TCC CTG 1968 

Trp Asn Leu Lys Pro Phe Thr Thr Arg Asp Phe Ser He Arg Ser Leu 
645 650 655 

20 GCT GAC CGG CTG GGG GAC CTG AGC TAT CTC ATC TAT GTG TTT CCT GAC 2016 
Ala Asp Arg Leu Gly Asp Leu Ser Tyr Leu He Tyr Val Phe Pro Asp 
660 665 670 

CGC CCC AAG GAT GAG GTG TTC TCC AAG TAG TAG ACT CCT GTG CTG GCT 2 064 

25 Arg Pro Lys Asp Glu Val Phe Ser Lys Tyr Tyr Thr Pro Val Leu Ala 
675 680 685 

AAA GCT GTT GAT GGA TAT GTG AAA CCA CAG ATC AAG CAA GTG GTC CCT 2112 
Lys Ala Val Asp Gly Tyr Val Lys Pro Gin He Lys Gin Val Val Pro 
30 690 695 700 

GAG TTT GTG AAT GCA TCT GCA GAT GCT GGG GGC AGC AGC GCC ACG TAG 2160 

Glu Phe Val Asn Ala Ser Ala Asp Ala Gly Gly Ser Ser Ala Thr Tyr 

705 710 715 720 

35 

ATG GAC CAG GCC CCC TCC CCA GCT GTG TGC CCC CAG GCT CCC TAT AAC 2208 

Met Asp Gin Ala Pro Ser Pro Ala Val Cys Pro Gin Ala Pro Tyr Asn 
725 730 735 

40 ATG TAC CCA CAG AAC CCT GAC CAT GTA CTC GAT CAG GAT GGA GAA TTC 2256 

Met Tyr Pro Gin Asn Pro Asp His Val Leu Asp Gin Asp Gly Glu Phe 
740 745 750 

GAC CTG GAT GAG ACC ATG GAT GTG GCC AGG CAC GTG GAG GAA CTC TTA 2304 
45 Asp Leu Asp Glu Thr Met Asp Val Ala Arg His Val Glu Glu Leu Leu 
755 760 765 

CGC CGA CCA ATG GAC AGT CTT GAC TCC CGC CTC TCG CCC CCT GCC GGT 23 52 

Arg Arg Pro Met Asp Ser Leu Asp Ser Arg Leu Ser Pro Pro Ala Gly 
50 770 775 780 

CTT TTC ACC TCT GCC AGA GGC TCC CTC TCA TGG GTA CCG CGG GCC CGG 24 00 

Leu Phe Thr Ser Ala Arg Gly Ser Leu Ser Trp Val Pro Arg Ala Arg 
785 790 795 800 



55 



GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC 244 8 
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ASP Pro Pro val Ala Thr Met Val Ser Lys Gly Glu GIu Leu Phe Thr 



810 



825 



830 



845 



855 



860 



880 

Pro SJs r ^^'^ ^-^^ ^« ^-^^ ^-^^ CCC 

Tyr Pro Asp Hxs Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro 

88S 890 895 

^ s= - ^ 

905 

TAC AAG ACC CGC GCC GAG GTG AA6 TTC GAG GGC GAC ACC CTG GTG AAr> 
Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Sp rlr Su Hi 

920 

S Jle C?u f ° of ''^^ ^° °*<= ATC CTG 

Arg lie Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 

GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG 
Gly H.S Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn VaJ Jyr He Set 

955 

S Z ^ X ^ Z S - s Z ^. ill - - 



970 



975 



He G?u '''''' TAC CAG GAG AAC 

Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin J^n 

985 

P^"" '''''' ^^"^ CTG CCC GAC AAC CAC TAC CTG 

Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn ^ 

1000 1005 

ser rll m n f f ''^^ ^C'^ ^C AAG CGC GAT CAC 

Ser^Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp 

■^"•^5 1020 
ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG 
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2496 



2544 



TGG 2592 



2640 



2688 



2736 



2784 



2832 



2880 



2928 



2976 



3024 



3072 



3120 

182 
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Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met 
1025 1030 1035 1040 

GAC GAG CTG TAC AAG TAA 313 8 

Asp Glu Leu Tyr Lys 
1045 



(2) INFORMATION FOR SEQ ID NO: 79: 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 



25 



30 



35 



40 



45 



50 



55 



Met 


Ala 


Gly 


Trp 


He 


Gin 


Ala 


Gin 


Gin 


Leu 


Gin 


Gly Asp Ala 


Leu 


Arg 


1 








5 










10 










15 




Gin 


Met 


Gin 


Val 


Leu 


Tyr 


Gly 


Gin 


His 


Phe 


Pro 


He 


Glu 


Val 


Arg 


His 








20 










25 










30 






Tyr 


Leu 


Ala 


Gin 


Trp 


He 


Glu 


Ser 


Gin 


Pro 


Trp 


Asp 


Ala 


He 


Asp 


Leu 






35 










40 










45 








Asp 


Asn 


Pro 


Gin 


Asp 


Arg 


Ala 


Gin 


Ala 


Thr 


Gin 


Leu- 


Leu Glu Gly 


Leu 




50 










55 










60 










Val 


Gin 


Glu 


Leu 


Gin 


Lys 


Lys 


Ala 


Glu 


His 


Gin 


Val 


Gly 


Glu 


Asp 


Gly 


65 










70 










75 










80 


Phe 


Leu 


Leu 


Lys 


He 


Lys 


Leu 


Gly 


His 


Tyr 


Ala 


Thr 


Gin 


Leu 


Gin 


Lys 










85 










90 










95 




Thr 


Tyr 


Asp 


Arg 


Cys 


Pro 


Leu 


Glu 


Leu 


Val 


Arg 


Cys 


He 


Arg 


His 


He 








100 










105 










110 






Leu 


Tyr 


Asn 


Glu 


Gin 


Arg 


Leu .Val 


Arg 


Glu 


Ala 


Asn 


Asn 


Cys 


Ser 


Ser 






115 










120 










125 








Pro 


Ala 


Gly 


He 


Leu 


Val 


Asp 


Ala 


Met 


Ser 


Gin 


Lys 


His 


Leu 


Gin 


He 




13 0 










135 










140 










Asn 


Gin 


Thr 


Phe 


Glu 


Glu 


Leu 


Arg 


Leu 


Val 


Thr 


Gin 


Asp 


Thr 


Glu 


Asn 


145 










150 










155 










160 


Glu 


Leu 


Lys 


Lys 


Leu 


Gin 


Gin 


Thr 


Gin 


Glu 


Tyr 


Phe 


He 


He 


Gin 


Tyr 










165 










170 










175 




Gin 


Glu 


Ser 


Leu 


Arg 


He 


Gin 


Ala 


Gin 


Phe 


Ala 


Gin 


Leu 


Ala 


Gin 


Leu 








180 










185 










190 






Ser 


Pro 


Gin 


Glu 


Arg 


Leu 


Ser 


Arg 


Glu 


Thr 


Ala 


Leu 


Gin 


Gin 


Lys 


Gin 






195 










200 










205 








Val 


Ser 


Leu 


Glu 


Ala 


Trp 


Leu 


Gin 


Arg 


Glu 


Ala 


Gin 


Thr 


Leu 


Gin 


Gin 




210 










215 










220 










Tyr 


Arg 


Val 


Glu 


Leu 


Ala 


Glu 


Lys 


His 


Gin 


Lys 


Thr 


Leu 


Gin 


Leu 


Leu 


225 










230 










235 










240 


Arg 


Lys 


Gin 


Gin 


Thr 


He 


He 


Leu 


Asp 


Asp 


Glu 


Leu 


He 


Gin 


Trp 


Lys 










245 










250 










255 




Arg 


Arg 


Gin 


Gin 


Leu 


Ala 


Gly 


Asn 


Gly 


Gly 


Pro 


Pro 


Glu Gly 


Ser 


Leu 








260 










265 










270 






Asp 


Val 


Leu 


Gin 


Ser 


Trp 


Cys 


Glu 


Lys 


Leu 


Ala 


Glu 


He 


He 


Trp 


Gin 



183 
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275 



280 



Asn Arg Gin Gin Zle Arg Arg HI gIu His Leu Cys Gl" Gin .eu Pro 



295 



lie Pro Gly Pro Val oiu Olu Met Leu Ala Glu III Asn Ala Thr He 



305 



310 



Thr ASP He lie Ser Al'a Leu Val Thr Ser Phe He Xle Glu ly. 

330 

cm Pro pro Gin Val Leu Lys Thr Gin Thr Lys Phe Ala Ala Thr Val 



.0 Arg Leu Leu Vai Gly Gly Lys Leu 12 Val His Met Asn Pro Pro Gin 

360 

val Lys Ala Thr He He Ser Glu Gin Gin Ala Lys Ser Leu Leu Lys 



3" 



Asn Glu Asn Thr Arg Asn Glu Cys Ser Gly Glu He Leu Asn Asn Cys 



cys val Met Glu Ty. His Gin Ala Thr Gly Thr Leu Ser Ala His Z 



400 



405 



Arg Asn Met Ser Leu Lys Ara Tie t^o a™ m * ^'^^ 

420 ^ ^ ^""^ ^""^ Ala 

0 Glu Ser Val Thr Glu Glu Lys Phe Thr- v^i t ^ . 

oxu iiys Pfte Thr Val Leu Phe Glu Ser Gin Phe 



440 



ser val Gly Ser Asn Glu Leu Val Phe Gin Val Lys Thr Leu Ser Leu 

460 

pro val val Val He Val His Gly Ser Gin Asp His Asn Ala Thr Ala 

47 ^ 

Thr val Leu Trp Asp Asn Ala Phe Ala Glu Pro Gly Arg Val Pro III 

490 

Ala val Pro Asp Lys Val Leu Trp Pro Gin Leu Cys Glu Ala til Asn 

Met Lys Phe Lys Ala Glu Val Gin ler Asn Arg Gly Leu T^r Lys Glu 

Asn Leu val Phe Leu Ala Gin Lys Leu Phe Asn Asn III Ser Ser His 

Leu Glu ASP Tyr Ser Gly III ser Val Ser Trp III ain Phe Asn Arg 

Glu Asn Leu Pro Gly Trp Asn ^r Thr Phe t" Gin Trp Phe Asp g" 

val Met Glu val Leu Lys Lys His His Lys Pro His Trp Asn Gly 

Ala He Leu Gly Phe Val Asn Lys III Gin Ala His Asp Le^ Leu He 

600 fin*; 
Asn Lys pro Asp Gly «xr Phe Leu Leu Arg Phe Ser Asp Ser Glu He 

615 620 
Gly Gly He Thr He Ala Trp Lys Phe Asp Ser Pro Glu Arg Asn Leu 

o J 0 



635 



Trp Asn Leu Lys Pro Phe Thr Thr Arg Asp ;h: Ser He Arg Ser "u 



645 



Ala ASP Arg Leu Gly Asp Leu Ser ^r Leu He Tyr Val Phe Pro Asp 
Arg Pro Lys Asp Glu Val Phe Ser lyl Tyr Tyr Thr Pro til Leu Ala 



Lys Ala val Asp Gly Tyr Val L^^ Pro Gin He Lys Gin Val Val Pro 

695 700 
^lu Phe val Asn Ala Ser Ala Asp Ala Gly Gly Ser Ser Ala Thr lyr 

7 15 

Met ASP Gin Ala Pro Ser Pro Ala Val Cys Pro Gin Ala Pro Tyr III 



184 
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725 730 735 





Met 


Tyr 


Pro 


Gin 
740 


Asn 


Pro 


Asp 


His 


Val 
745 


Leu 


Asp 


Gin 


Asp 


Gly 
750 


Glu 


Phe 




Asp 


Leu 


Asp 


Glu 


Thr 


Met 


Asp 


Val 


Ala 


Arg 


His 


Val 


Glu 


Glu 


Leu 


Leu 


5 






755 










760 










765 










Arg 


Arg 


Pro 


Met 


Asp 


Ser 


Leu 


Asp 


Ser Arg 


Leu 


Ser 


Pro 


Pro 


Ala 


Gly 






770 










775 










780 












Leu 


Phe 


Thr 


Ser 


Ala 


Arg 


Gly 


Ser 


Leu 


Ser 


Trp 


Val 


Pro 


Arg 


Ala 


Arg 




785 










790 










795 










800 


10 


Asp 


Pro 


Pro 


Val 


Ala 
805 


Thr 


Met 


Val 


Ser 


Lys 
810 


Gly 


Glu 


Glu 


Leu 


Phe 
815 


Thr 




Gly 


Val 


Val 


Pro 


He 


Leu 


Val 


Glu 


Leu 


Asp 


Gly Asp 


Val 


Asn 


Gly 


His 










820 










825 










830 








Lys 


Phe 


Ser 


Val 


Ser 


Gly 


Glu 


Gly Glu 


Gly Asp 


Ala 


Thr 


Tyr Gly 


Lys 


15 






835 










840 










845 










Leu 


Thr 
850 


Leu 


Lys 


Phe 


He 


Cys 
855 


Thr 


Thr 


Gly 


Lys 


Leu 
860 


Pro 


Val 


Pro 


Trp 




Pro 


Thr 


Leu 


Val 


Thr 


Thr 


Leu 


Thr Tyr Gly Val 


Gin 


Cys 


Phe 


Ser 


Arg 




865 










870 










875 










880 


20 


Tyr 


Pro 


Asp 


His 


Met 
885 


Lys 


Gin 


His 


Asp 


Phe 
890 


Phe 


Lys 


Ser 


Ala 


Met 
695 


Pro 




Glu 


Gly 


Tyr 


Val 
900 


Gin 


Glu 


Arg 


Thr 


He 
905 


Phe 


Phe 


Lys 


Asp 


Asp 
910 


Gly 


Asn 




Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 


Val 


Lys 


Phe 


Glu 


Gly Asp 


Thr 


Leu 


Val 


Asn 


25 






915 










920 










925 










Arg 


He 
930 


Glu 


Leu 


Lys 


Gly 


He 
935 


Asp 


Phe 


Lys 


Glu 


Asp 
940 


Gly 


Asn 


He 


Leu 




Gly 


His 


Lys 


Leu 


Glu 


Tyr 


Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


He 


Met 




945 










950 










955 










960 


30 


Ala 


Asp 


Lys 


Gin 


Lys 
965 


Asn 


Gly 


He 


Lys 


Val 
970 


Asn 


Phe 


Lys 


He 


Arg 
975 


His 




Asn 


He 


Glu 


Asp 
980 


Gly 


Ser 


Val 


Gin 


Leu 
985 


Ala 


Asp 


His 


Tyr 


Gin 
990 


Gin 


Asn 




Thr 


Pro 


He 


Gly 


Asp 


Gly 


Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


35 






995 








1000 








1005 










Ser 


Thr 


Gin 


Ser 


Ala 


Leu 


Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 


His 




1010 








1015 








1020 












Met 


Val 


Leu 


Leu 


Glu 


Phe 


Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr 


Leu 


Gly 


Met 



025 1030 1035 1040 

40 Asp Glu Leu Tyr Lys 



1045 



(2) INFORMATION FOR SEQ ID NO: 80: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
TGGGATCCTC AGGCCGTGCT GCTGGCCG 28 

55 

(2) INFORMATION FOR SEQ ID NO: 81: 
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186 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
GTCTCGAGGG AGCATGGGCA CCTTGCG 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 
TGGGATCCGA GAAGTCTATA TCCCATC 

27 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:83: 
TGGGATCCTT AGAAGTCTAT ATCCCATC 

28 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
GTCTCGAGCC ATGAACGCCC CCGAGCGG 

28 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

186 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
GTGAATTCTC GTCTGATTTC TGGCAGGAGG 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



20 



25 



30 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
GTGAATTCTT TACGTCTGAT TTCTGGCAGG 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GTCTCGAGCC ATGGACGAAC TGTTCCCCCT CATC 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



34 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
GTGGATCCAA GGAGCTGATC TGACTCAGCA G 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



31 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GTGGATCCTT AGGAGCTGAT CTGACTCAGC AG 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 90 : 
CCTCCTAAGC TTATCATGGA CCATTATGAT TC 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 91 : 
CCTCCTGGAT CCCTGCGCAG GATGATGGTC CAG 

(2) INFORMATION FOR SEQ ID NO : 92 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
GGATGGAAGC TTCAATGGCT GCCATCCGGA AGAAACTGGT GATTG 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
GGATG6GGAT CCTCACAAGA CAAGGCAACC AGATTTTTrC TTCCC 



33 



45 



45 
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(2) INFORMATION FOR SEQ ID NO:94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
GGGAAGCTTC CATGAGCGAG ACGGTCATC 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



29 



25 



30 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

CCCGGATCCT CAGGGAGAAC CCCGCTTC 

<2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GTGAATTCGA CCATGGAGCG GCCCCCGGGG 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



28 



30 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

GTGGTACCCA TTCTGTTAAC CAACTCC 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 8 base pairs 



27 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
GTGGTACCTC ATTCTGTTAA CCAACTCC 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GTCTCGAGAG ATGCTGTCCC GTGGGTGG 

(2) INFORMATION FOR SEQ ID NO: 10 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
GTGAATTCGC TTCCTCTTGA GGGAACC 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
GTGAATTCAC TTCCTCTTGA GGGAACC 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:102: 
GTCTCGAGCC ATGGAGAACT TCCAAAAGG 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



29 



15 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
GTGGATCCCA GAGTCGAAGA TGGGGTAC 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



28 



30 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 
GTGGATCCTC AGAGTCGAAG ATGGGGTAC 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GTGAATTCGG CGATGCCAGA CCCCGCGGCG 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



29 



30 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
GTGGATCCCA GGCACAGGCA GCCTCAGCCT TC 



32 



191 
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(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
GTGGATCCTC AGGCACAGGC AGCCTCAGCC TTC 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2613 
(D) OTHER INFORMATION: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

mII '''''' ''''^ "''^ GTG GTG CCC ATC CTG 

Met Val ser Lys Gly Glu Glu Leu Phe Thr Gly Val VaJ So Tl. 

^ 10 



15 



s s z s z s z s= s ^. s s ;p s s 
s° z s z z s s - ™ 2^ s;^ s 

40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC CTC 2ir-n T^r^n 

Cys Thr Thr Gly x.ys Leu Pro Val Pro Trp Pro Jhr ."u "It ^hr 

60 

s ii^ s J- s z 

70 

S° St; S= J,*: ^ °- J- ^ - - - c« 1 

^ Gl" <31y Tyr Val Gin Glu 

85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 



33 



48 



96 



144 



192 



240 



288 



336 

192 
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Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

5 Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 4 32 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

10 130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

15 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

20 GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 

25 Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

30 210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 72 0 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

35 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCG GCG ATG CCA GAC CCC 768 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ala Met Pro Asp Pro 
245 250 255 

40 GCG GCG CAC CTG CCC TTC TTC TAC GGC AGC ATC TCG CGT GCC GAG GCC 816 

Ala Ala His Leu Pro Phe Phe Tyr Gly Ser He Ser Arg Ala Glu Ala 
260 265 270 

GAG GAG CAC CTG AAG CTG GCG GGC ATG GCG GAC GGG CTC TTC CTG CTG 864 

45 Glu Glu His Leu Lys Leu Ala Gly Met Ala Asp Gly Leu Phe LeiU Leu 
275 280 285 

CGC CAG TGC CTG CGC TCG CTG GGC GGC TAT GTG CTG TCG CTC GTG CAC 912 

Arg Gin Cys Leu Arg Ser Leu Gly Gly Tyr Val Leu Ser Leu Val His 

50 290 295 300 

GAT GTG CGC TTC CAC CAC TTT CCC ATC GAG CGC CAG CTC AAC GGC ACC 960 

Asp Val Arg Phe His His Phe Pro He Glu Arg Gin Leu Asn Gly Thr 
305 310 315 320 



55 



TAC GCC ATT GCC GGC GGC AAA GCG CAC TGT GGA CCG GCA GAG CTC TGC 1008 

193 
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40 



45 



50 



rrr „. „^ ^^^^^ 

,„ s s= JJJ s= s s s s s= s s 5 2 

365 

s c s ^ - - - - - ..c 

370 ^ Arg Gin Thr Trp Lys 

15 380 

c°t° s - s s ji: s r - 

385 " ^-^^ He Ser Gin Ala Pro Gin 

395 

on 

GTG GAG AAG CTC ATT CPT ar-r- ^ 

v.. o.. ... „ SI s s s s= s s 
« s x= s ?s s s= ti: X - - - - ^ 

420 '^^'^ 

^ X i^p^ 21 s r r -° -° 

30 435 ^ ^ Arg Lys Glu Gin Gly 

445 

?S ^Jr' Sa r 

450 ''^'^ ^ Thr Val Tyr His Tyr Leu 

J5 "^'^ 460 

"e ser X Asp ^ JJs 

465 ^ ''^^ Tyr Cys He Pro Glu Gly Thr Lys 

480 

SI s= s 2 s s 2^ ir. X r r ^ "° °- 

u vai Glu Tyr Leu Lys Leu Lys Ala Asp 

S s s= ^ ir. s.^ r r - 

500 Ser Ser Ala Ser 

510 

AAC GCC TCA GGG GCT GCT GCT rrr ana 

- «a „. S - ™ SI SI S III S 

HI S I?I III S ^ - - - - - I ..c 

530 ^ ™^ Leu Asn Ser Asp Gly Tyr 



1056 



55 535 540 

ACC CCT <«C CO. =C 0=0 »c= r=c cc, ^ cc. a» cc= 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1680 

194 
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Thr Pro Glu Pro Ala Arg lie Thr Ser Pro Asp Lys Pro Arg Pro Met 
545 550 555 560 

CCC ATG GAC ACG AGC GTG TAT GAG AGC CCC TAG AGO GAC CCA GAG GAG 1728 
5 Pro Met Asp Thr Ser Val Tyr Glu Ser Pro Tyr Ser Asp Pro Glu Glu 

565 570 575 

CTC AAG GAC AAG AAG CTC TTC CTG AAG CGC GAT AAC CTC CTC ATA GCT 1776 
Leu Lys Asp Lys Lys Leu Phe Leu Lys Arg Asp Asn Leu Leu He Ala 
10 580 585 590 

GAC ATT GAA CTT GGC TGC GGC AAC TTT GGC TCA GTG CGC CAG GGC GTG 1824 

Asp He Glu Leu Gly Cys Gly Asn Phe Gly Ser Val Arg Gin Gly Val 
595 600 605 

15 

TAC CGC ATG CGC AAG AAG CAG ATC GAC GTG GCC ATC AAG GTG CTG AAG 1872 

Tyr Arg Met Arg Lys Lys Gin He Asp Val Ala He Lys Val Leu Lys 
610 615 620 

20 CAG GGC ACG GAG AAG GCA GAC ACG GAA GAG ATG ATG CGC GAG GCG CAG 1920 
Gin Gly Thr Glu Lys Ala Asp Thr Glu Glu Met Met Arg Glu Ala Gin 
625 630 635 640 

ATC ATG CAC CAG CTG GAC AAC CCC TAC ATC GTG CGG CTC ATT GGC GTC 1968 
25 He Met His Gin Leu Asp Asn Pro Tyr He Val Arg Leu He Gly Val 

645 650 655 

TGC CAG GCC GAG GCC CTC ATG CTG GTC ATG GAG ATG GCT GGG GGC GGG 2 016 

Cys Gin Ala Glu Ala Leu Met Leu Val Met Glu Met Ala Gly Gly Gly 
30 660 665 670 

CCG CTG CAC AAG TTC CTG GTC GGC AAG AGG GAG GAG ATC CCT GTG AGC 2064 

Pro Leu His Lys Phe Leu Val Gly Lys Arg Glu Glu He Pro Val Ser 
675 680 685 

35 

AAT GTG GCC GAG CTG CTG CAC CAG GTG TCC ATG GGG ATG AAG TAC CTG 2112 

Asn Val Ala Glu Leu Leu His Gin Val Ser Met Gly Met Lys Tyr Leu 
690 695 700 

40 GAG GAG AAG AAC TTT GTG CAC CGT GAC CTG GCG GCC CGC AAC GTC CTG 2160 
Glu Glu Lys Asn Phe Val His Arg Asp Leu Ala Ala Arg Asn Val Leu 
705 710 715 720 

CTG GTT AAC CGG CAC TAC GCC AAG ATC AGC GAC TTT GGC CTC TCC AAA 2208 
45 Leu Val Asn Arg His Tyr Ala Lys He Ser Asp Phe Gly Leu Ser Lys 

725 730 735 

GCA CTG GGT GCC GAC GAC AGC TAC TAC ACT GCC CGC TCA GCA GGG AAG 2256 
Ala Leu Gly Ala Asp Asp Ser Tyr Tyr Thr Ala Arg Ser Ala Gly Lys 
50 740 745 750 

TGG CCG CTC AAG TGG TAC GCA CCC GAA TGC ATC AAC TTC CGC AAG TTC 2304 
Trp Pro Leu Lys Trp Tyr Ala Pro Glu Cys He Asn Phe Arg Lys Phe 
755 760 765 



55 



TCC AGC CGC AGC GAT GTC TGG AGC TAT GGG GTC ACC ATG TGG GAG GCC 2352 

195 
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Ser Ser Arg Ser Asp VaX Trp Ser Tyr Gly Val Thr Met Trp Glu Ala 



780 



TTG TCC TAC GGC CAG AAG CCC TAC AAG AAG ATG AAA GGG CCG GAG GTC 
I-au ser Tyr Gly Gin Lys Pro ^lyr X,ys Lys Met Lys Gly Pro Su Va^ 

Sit A^a pJe ^'^^ TGT 

Met Ala Phe He Glu Gin Gly Lys Arg Met Glu Cys Pro Pro Glu Cys 

805 810 815 

CCA CCC GAA CTG TAC GCA CTC ATG AGT GAG TGC TGG ATC TAC AAG TGG 
Pro Pro Glu Leu Tyr Ala Leu Met Ser Asp Cys Trp il^ ^r ^s 

820 one XT 

825 830 

GAG GAT CGC CCC GAC TTC CTG ACC GTG GAG CAG CGC ATG CGA GCC TCT 
Glu ASP Arg Pro Asp Phe Leu Thr Val Glu Gin Arg Met Arg 

840 845 

Tyr Tyr Ser Leu Ala Ser Lys Val Glu Gly Pro Pro Gly Ser Thr Gin 

8gQ 

AAG GCT GAG GCT GCC TGT GCC TGA 

Lys Ala Glu Ala Ala Cys Ala ^^-^^ 



865 



870 



(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Met Val ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

val Glu Leu Asp Gly Asp Val Asn Gly Lys Phe Ser Val 5er Gly 

Glu Gly Glu Gly Asp Ala Thr Tyr G^y Lys Leu Thr Leu Lys Phe He 

cys Thr Thr Gly Lys Leu Pro ial Pro Trp Pro Thr llu Val Thr Thr 

^ ^ 60 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

75 80 



Gin His ASP Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin llu 

90 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg IL Glu 
val Lys Phe Glu Gly Asp Thr Leu HI Asn Arg He Glu llu Lys Gly 



2400 



2448 



2496 



2544 



2592 
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115 120 125 





lie 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly 


Asn 


He 


Leu 


Gly 


His 


Lys 


Leu 


Glu 


Tyr 






130 










135 










140 












Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


He 


Met 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 


5 


145 










150 










155 










160 




Gly 


lie 


Lys 


Val 


Asn 


Phe 


Lys 


He 


Arg 


His 


Asn 


He 


Glu 


Asp 


Gly 


Ser 












165 










170 










175 






Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


Pro 


He 


Gly 


Asp 


Gly 










180 










185 










190 








Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 


Thr 


Gin 


Ser 


Ala 


Leu 








195 










200 










205 










Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 


His 


Met 


Val 


Leu 


Leu 


Glu 


Phe 






210 










215 










220 












Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr 


Leu 


Gly 


Met 


Asp 


Glu 


Leu 


Tyr 


Lys 


Ser 


15 


225 










230 










235 










240 




Gly 


Leu 


Arg 


Ser 


Arg 


Ala 


Gin 


Ala 


Ser 


Asn 


Ser 


Ala 


Met 


Pro 


Asp 


Pro 












245 










250 










255 






Ala 


Ala 


His 


Leu 


Pro 


Phe 


Phe 


Tyr 


Gly 


Ser 


He 


Ser 


Arg 


Ala 


Glu 


Ala 










260 










265 










270 






20 


Glu 


Glu 


His 


Leu 


Lys 


Leu 


Ala 


Gly 


Met 


Ala 


Asp 


Gly 


Leu 


Phe 


Leu 


Leu 








275 










280 










285 










Arg 


Gin 


Cys 


Leu 


Arg 


Ser 


Leu 


Gly 


Gly 


Tyr 


Val 


Leu 


Ser 


Leu 


Val 


His 






290 










295 










300 












Asp 


Val 


Arg 


Phe 


His 


His 


Phe 


Pro 


He 


Glu 


Arg 


Gin 


Leu 


Asn 


Gly 


Thr 


25 


305 










310 










315 










320 




Tyr 


Ala 


lie 


Ala 


Gly 


Gly 


Lys 


Ala 


His 


Cys 


Gly 


Pro 


Ala 


Glu 


Leu 


Cys 












325 










330 










335 






Glu 


Phe 


Tyr 


Ser 


Arg 


Asp 


Pro 


Asp 


Gly 


Leu 


Pro 


Cys 


Asn 


Leu 


Arg 


Lys 










340 










345 










350 






30 


Pro 


Cys 


Asn 


Arg 


Pro 


Ser 


Gly 


Leu 


Glu 


Pro 


Gin 


Pro 


Gly 


Val 


Phe 


Asp 








355 










360 










365 










Cys 


Leu 


Arg 


Asp 


Ala 


Met 


Val 


Arg 


Asp 


Tyr 


Val 


Arg 


Gin 


Thr 


Trp 


Lys 






370 










375 










380 












Leu 


Glu 


Gly 


Glu 


Ala 


Leu 


Glu 


Gin 


Ala 


He 


He 


Ser 


Gin 


Ala 


Pro 


Gin 


35 


385 










390 










395 










400 




Val 


Glu 


Lys 


Leu 


He 


Ala 


Thr 


Thr 


Ala 


His 


Glu 


Arg 


Met 


Pro 


Trp 


Tyr 












405 










410 










415 






His 


Ser 


Ser 


Leu 


Thr 


Arg 


Glu 


Glu 


Ala 


Glu 


Arg 


Lys 


Leu 


Tyr 


Ser 


Gly 










420 










425 










430 






40 


Ala 


Gin 


Thr 


Asp 


Gly Lys 


Phe 


Leu 


Leu 


Arg 


Pro 


Arg 


Lys 


Glu 


Gin 


Gly 








435 










440 










445 










Thr 


Tyr 


Ala 


Leu 


Ser 


Leu 


He 


Tyr 


Gly 


Lys 


Thr 


Val 


Tyr 


His 


Tyr 


Leu 






450 










455 










460 












lie 


Ser 


Gin 


Asp 


Lys 


Ala 


Gly 


Lys 


Tyr 


Cys 


He 


Pro 


Glu 


Gly 


Thr 


Lys 


45 


465 










470 










475 










480 




Phe 


Asp 


Thr 


Leu 


Trp 


Gin 


Leu 


Val 


Glu 


Tyr 


Leu 


Lys 


Leu 


Lys 


Ala 


Asp 












485 










490 










495 






Gly 


Leu 


He 


Tyr 


Cys 


Leu 


Lys 


Glu 


Ala 


Cys 


Pro 


Asn 


Ser 


Ser 


Ala 


Ser 










500 










505 










510 






50 


Asn 


Ala 


Ser 


Gly 


Ala 


Ala 


Ala 


Pro 


Thr 


Leu 


Pro 


Ala 


His 


Pro 


Ser 


Thr 








515 










520 










525 










Leu 


Thr 


His 


Pro 


Gin 


Arg 


Arg 


He 


Asp 


Thr 


Leu 


Asn 


Ser 


Asp 


Gly 


Tyr 






530 










535 










540 












Thr 


Pro 


Glu 


Pro 


Ala 


Arg 


He 


Thr 


Ser 


Pro 


Asp 


Lys 


Pro 


Arg 


Pro 


Met 


55 


545 










550 










555 










560 




Pro 


Met 


Asp 


Thr 


Ser 


Val 


Tyr 


Glu 


Ser 


Pro 


Tyr 


Ser 


Asp 


Pro 


Glu 


Glu 
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570 q^c- 

Leu Lys A.p Lys Lys Leu Phe Leu Lys Arg Asp Asn Leu Leu "e Ala 



585 



ASP He Glu Leu Gly Cys Gly Asn Phe Gly Ser Val Arg Gin Gly Val 

600 fiftR 
Tyr Arg Met Arg Lys Lys Gla He Asp Val Ala He Lys Val Leu Lys 

620 

Gin Gly Thr Glu Lys Ala Asp Thr Glu Glu Met Met Arg Glu Ala Gin 



10 He Met His Gin Leu Asp Asn Pro Tyr lie "l Arg Leu He Gly v^l" 

cys Gin Ala Glu Ala Leu Met Leu Val M^t Glu Met Ala Gly G^y Gly 

6 65 



Pro Leu His Lys Phe Leu Val Gly lys Arg Glu Glu He III Val Ser 



675 



680 



Asn val Ala Glu Leu Leu His Gin Val Ser Met Gly Met Lys Tyr Leu 

700 

Glu Glu Lys Asn Phe Val His Arg Asp Leu Ala Ala Arg Asn Val Leu 

715 

Leu val Asn Arg His .yr Ala Lys He Ser Asp Phe Gly Leu Ser lys 

Ala Leu Gly Ala Asp Asp Ser Tyr ^r III Ala Arg Ser Ala Gly Lys 

Trp Pro Leu Lys Trp Tyr Ala Pro G^u Cys He Asn Phe Irg Lys Phe 

760 765 
ser ser Arg Ser Asp Val Trp Ser Tyr Gly Val Thr Met Trp Glu Ala 

"^"^^ 780 
Leu ser Tyr Gly Gin Lys Pro Tyr Lys Lys Met Lys Gly Pro Glu Val 

Met Ala Phe He Glu Gin Gly Lys Arg Met g!u Cys Pro Pro Glu ^^s 

^ 810 QIC 

Pro Pro Glu Leu Tyr Ala Leu Met Ser Asp Cys Trp He Tyr "s Trp 

^ 825 830 

Glu ASP Arg Pro Asp Phe Leu Thr Val Glu Gin Arg Met Arg Ala Cys 
Tyr Tyr Ser Leu Ala Ser Lys Z Glu Gly Pro Pro Gl'y Ser Thr Gin 



Lys Ala Glu Ala Ala Cys Ala 



860 



865 



870 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 2598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2595 
(D) OTHER INFORMATION: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



198 
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ATG CCA GAC CCC GCG GCG CAC CTG CCC TTC TTC TAC GGC AGC ATC TCG 4 8 

Met Pro Asp Pro Ala Ala His Leu Pro Phe Phe Tyr Gly Ser lie Ser 
15 10 15 

5 

CGT GCC GAG GCG GAG GAG CAC CTG AAG CTG GCG GGC ATG GCG GAC GGG 96 
Arg Ala Glu Ala Glu Glu His Leu Lys Leu Ala Gly Met Ala Asp Gly 
20 25 30 

10 CTC TTC CTG CTG CGC CAG TGC CTG CGC TCG CTG GGC GGC TAT GTG CTG 144 
Leu Phe Leu Leu Arg Gin Cys Leu Arg Ser Leu Gly Gly Tyr Val Leu 
35 40 45 

TCG CTC GTG CAC GAT GTG CGC TTC CAC CAC TTT CCC ATC GAG CGC CAG 192 
15 Ser Leu Val His Asp Val Arg Phe His His Phe Pro lie Glu Arg Gin 
50 55 60 

CTC AAC GGC ACC TAC GCC ATT GCC GGC GGC AAA GCG CAC TGT GGA CCG 24 0 

Leu Asn Gly Thr Tyr Ala lie Ala Gly Gly Lys Ala His Cys Gly Pro 
20 65 70 75 80 

GCA GAG CTC TGC GAG TTC TAC TCG CGC GAC CCC GAC GGG CTG CCC TGC 288 

Ala Glu Leu Cys Glu Phe Tyr Ser Arg Asp Pro Asp Gly Leu Pro Cys 
85 90 95 

25 

AAC CTG CGC AAG CCG TGC AAC CGG CCG TCG GGC CTC GAG CCG CAG CCG 336 

Asn Leu Arg Lys Pro Cys Asn Arg Pro Ser Gly Leu Glu Pro Gin Pro 

100 105 110 

30 GGG GTC TTC GAC TGC CTG CGA GAC GCC ATG GTG CGT GAC TAC GTG CGC 384 
Gly Val Phe Asp Cys Leu Arg Asp Ala Met Val Arg Asp Tyr Val Arg 
115 120 125 

CAG ACG TGG AAG CTG GAG GGC GAG GCC CTG GAG CAG GCC ATC ATC AGC 432 
35 Gin Thr Trp Lys Leu Glu Gly Glu Ala Leu Glu Gin Ala lie lie Ser 
130 135 140 

CAG GCC CCG CAG GTG GAG AAG CTC ATT GCT ACG ACG GCC CAC GAG CGG 480 
Gin Ala Pro Gin Val Glu Lys Leu lie Ala Thr Thr Ala His Glu Arg 
40 145 150 155 160 

ATG CCC TGG TAC CAC AGC AGC CTG ACG CGT GAG GAG GCC GAG CGC AAA 528 

Met Pro Trp Tyr His Ser Ser Leu Thr Arg Glu Glu Ala Glu Arg Lys 
165 170 175 

45 

CTT TAC TCT GGG GCG CAG ACC GAC GGC AAG TTC CTG CTG AGG CCG CGG 576 

Leu Tyr Ser Gly Ala Gin Thr Asp Gly Lys Phe Leu Leu Arg Pro Arg 

180 185 190 

50 AAG GAG CAG GGC ACA TAC GCC CTG TCC CTC ATC TAT GGG AAG ACG GTG 624 
Lys Glu Gin Gly Thr Tyr Ala Leu Ser Leu lie Tyr Gly Lys Thr Val 
195 200 205 

TAC CAC TAC CTC ATC AGC CAA GAC AAG GCG GGC AAG TAC TGC ATT CCC 672 
55 Tyr His Tyr Leu lie Ser Gin Asp Lys Ala Gly Lys Tyr Cys lie Pro 
210 215 220 
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200 



"° 235 

CTG AAG GCG GAC GGG CTC ATC TAC Tfr f-iv 

, ^„ - - 



255 



270 

S3 s s s ii: s SI s ir. s s ?s s 



285 



s s ?S S ^ ^ s s ?s s - 

300 



s s ?s s s z z ™? s 

315 

335 

CTC CTC ATA GCT GAC ATT GAA CTT GGG TGP rrn i^^r^ 

..u ^„ „. ^. ^ ^„ ^ »oc ^ ™ 

^"^^ 350 

SI i?^ itt zt r r "^^^ ^^'^ °- °- 

355 ^ "-^^ ^^'^ Asp Val Ala He 

365 

g'Jh s ?s r r °- -° 

3,0 Thr Glu Lys Ala Asp Thr Glu Glu Met Met 

^'^ 380 

Z G?U ilt CCC TAC ATC GTG CGG 

385 39J Asp Asn Pro Tyr He Val Arg 

S ^ S° I?; S r 5- 1° 

Anl ^^"^ Wet Leu val Met Glu Met 



410 



415 



A^I G?y G?y G^y Pro S3' T 

420 f-! Arg Glu Glu 



"5 430 



?s „^ ™ s 



445 



720 



768 



816 



864 



912 



960 



1006 



1056 



1104 



1152 



1200 



1248 



1296 



1344 
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ATG AAG TAG CTG GAG GAG AAG AAC TTT GTG CAC CGT GAG CTG GCG GCC 1392 

Met Lys Tyr Leu Glu Glu Lys Asn Phe Val His Arg Asp Leu Ala Ala 
450 455 460 

5 

CGC AAC GTC CTG CTG GTT AAC CGG CAC TAG GCC AAG ATC AGC GAC TTT 1440 

Arg Asn Val Leu Leu Val Asn Arg His Tyr Ala Lys lie Ser Asp Phe 

465 470 475 480 

10 GGC CTG TCC AAA GCA CTG GGT GCC GAC GAC AGC TAC TAC ACT GCC CGC 14 88 

Gly Leu Ser Lys Ala Leu Gly Ala Asp Asp Ser Tyr Tyr Thr Ala Arg 
485 490 495 

TCA GCA GGG AAG TGG CCG CTC AAG TGG TAC GCA CCC GAA TGC ATC AAC 1536 
15 Ser Ala Gly Lys Trp Pro Leu Lys Trp Tyr Ala Pro Glu Cys lie Asn 
500 505 510 

TTC CGC AAG TTC TCC AGC CGC AGC GAT GTC TGG AGC TAT GGG GTC ACC 15 84 

Phe Arg Lys Phe Ser Ser Arg Ser Asp Val Trp Ser Tyr Gly Val Thr 
20 515 520 525 

ATG TGG GAG GCC TTG TCC TAC GGC CAG AAG CCC TAC AAG AAG ATG AAA 1632 

Met Trp Glu Ala Leu Ser Tyr Gly Gin Lys Pro Tyr Lys Lys Met Lys 
530 535 540 

25 

GGG CCG GAG GTC ATG GCC TTC ATC GAG CAG GGC AAG CGG ATG GAG TGC 1680 

Gly Pro Glu Val Met Ala Phe lie Glu Gin Gly Lys Arg Met Glu Cys 

545 550 555 560 

30 CCA CCA GAG TGT CCA CCC GAA CTG TAC GCA CTC ATG AGT GAC TGC TGG 1728 
Pro Pro Glu Cys Pro Pro Glu Leu Tyr Ala Leu Met Ser Asp Cys Trp 
565 570 575 

ATC TAC AAG TGG GAG GAT CGC CCC GAC TTC CTG ACC GTG GAG CAG CGC 1776 
35 lie Tyr Lys Trp Glu Asp Arg Pro Asp Phe Leu Thr Val Glu Gin Arg 
580 585 590 

ATG CGA GCC TGT TAC TAC AGC CTG GCC AGC AAG GTG GAA GGG CCC CCA 1824 
Met Arg Ala Cys Tyr Tyr Ser Leu Ala Ser Lys Val Glu Gly Pro Pro 
40 595 600 605 

GGC AGC ACA CAG AAG GCT GAG GCT GCC TGT GCC TGG GAT CCA CCG GTC 1872 
Gly Ser Thr Gin Lys Ala Glu Ala Ala Cys Ala Trp Asp Pro Pro Val 
610 615 620 

45 

GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC 1920 
Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro 
625 630 635 640 

50 ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG 196 8 

lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val 
645 650 655 

TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG 2 016 

55 Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 
660 665 670 
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III ne ?hr rTr OIv T x ''^^ """^ <'-° 

6^5 ^ ''^^ Trp Pro Thr Leu Val 



680 



6B5 



S ^ s .-^ p - s s Jv- s 



700 



m!^ T'^C GCC ATG CCC GAA GGC TAG GTC 

Met Lys Gin H.s Asp Phe Phe Lys Ser Ala Met Pro ITy ^r 

715 

CAG GAG CGC ACC ATC TTC TTC AAG GAG GAC GGC AAC TAG AAG ACC CGC 
cm Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn ^r Ss ihr 



730 



735 



tr. ^ ir. s ?s ^ z 1- s= 

745 

s s 2^ ^ z s - - s - - 

760 

GAG TAG AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG 
Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala JJs Sn 



780 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



^ z ir.^t^^z^^. s r "'^ ^ 

785 ^ Arg Has Asn He Glu Asp 

795 800 

ser vTx Sn ^^"^ ^^^'^ "'^^ CCC ATC GGC 

Gly ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly 

8°= 810 815 

ASP S Pro vTl r° r° "^"^^ ^= ^ CAG TCC 

ASP Gly Pro val Leu Leu Pro Asp Asn His Tyr Leu ser Thr Gin Ser 

"° 825 830 

GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG 
Ala Leu ser Lys Asp Pro Asn Glu Lys Arg Asp His Met vll Su Su 

840 645 

Oil 111 T.' CTG TAC 

Glu Phe val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr 

AAG TAA 

Lys 2598 
865 



(2) INFORMATION FOR SEQ ID NO: 111: 



2400 



2448 



2496 



2544 



2592 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 





Met 


Pro 


Asp 


Pro 


Ala 


Ala 


His 


Leu 


Pro 


Phe 


Phe 


Tyr 


Gly 


Ser 


He 


Ser 




1 








5 










10 










15 






Arg 


Ala 


Glu 


Ala 


Glu 


Glu 


His 


Leu 


Lys 


Leu 


Ala 


Gly 


Met 


Ala 


Asp 


Gly 


15 








20 










25 










30 








Leu 


Phe 


Leu 


Leu 


Arg 


Gin 


Cys 


Leu 


Arg 


Ser 


Leu 


Gly 


Gly 


Tyr 


Val 


Leu 








35 










40 










45 










Ser 


Leu 


Val 


His 


Asp 


Val 


Arg 


Phe 


His 


His 


Phe 


Pro 


He 


Glu 


Arg 


Gin 






50 










55 










60 










20 


Leu 


Asn 


Gly 


Thr 


Tyr 


Ala 


He 


Ala 


Gly 


Gly Lys 


Ala 


His 


Cys 


Gly 


Pro 




65 










70 










75 










80 




Ala 


Glu 


Leu 


Cys 


Glu 


Phe 


Tyr 


Ser 


Arg 


Asp 


Pro 


Asp 


Gly Leu 


Pro 


Cys 












85 










90 










95 






Asn 


Leu 


Arg 


Lys 


Pro 


Cys 


Asn 


Arg 


Pro 


Ser Gly 


Leu 


Glu 


Pro 


Gin 


Pro 


25 








100 










105 










110 








Gly 


val 


Phe 


Asp 


Cys 


Leu 


Arg 


Asp 


Ala 


Met 


Val 


Arg 


Asp 


Tyr 


Val 


Arg 








115 










120 










125 










Gin 


Thr 


Trp 


Lys 


Leu 


Glu 


Gly 


Glu 


Ala 


Leu 


Glu 


Gin 


Ala 


He 


He 


Ser 






130 










135 










140 










30 


Gin 


Ala 


Pro 


Gin 


Val 


Glu 


Lys 


Leu 


He 


Ala 


Thr 


Thr 


Ala 


His 


Glu 


Arg 




145 










150 










155 










160 




Met 


Pro 


Trp 


Tyr 


His 


Ser 


Ser 


Leu 


Thr 


Arg 


Glu 


Glu 


Ala 


Glu 


Arg 


Lys 












165 










170 










175 






Leu 


Tyr 


Ser 


Gly 


Ala 


Gin 


Thr 


Asp 


Gly 


Lys 


Phe 


Leu 


Leu Arg 


Pro 


Arg 


35 








180 










185 










190 








Lys 


Glu 


Gin 


Gly 


Thr 


Tyr 


Ala 


Leu 


Ser 


Leu 


He 


Tyr 


Gly Lys 


Thr 


val 








195 










200 










205 










Tyr 


His 


Tyr 


Leu 


lie 


Ser 


Gin 


Asp 


Lys 


Ala 


Gly 


Lys 


Tyr 


Cys 


He 


Pro 






210 










215 










220 










40 


Glu 


Gly 


Thr 


Lys 


Phe 


Asp 


Thr 


Leu 


Trp 


Gin 


Leu 


Val 


Glu 


Tyr 


Leu 


Lys 




225 










230 










235 










240 




Leu 


Lys 


Ala 


Asp 


Gly 


Leu 


He 


Tyr 


Cys 


Leu 


Lys 


Glu 


Ala 


Cys 


Pro 


Asn 












245 










250 










255 






Ser 


Ser 


Ala 


Ser 


Asn 


Ala 


Ser 


Gly Ala 


Ala 


Ala 


Pro 


Thr 


Leu 


Pro 


Ala 


45 








260 










265 










270 








His 


Pro 


Ser 


Thr 


Leu 


Thr 


His 


Pro 


Gin 


Arg 


Arg 


He 


Asp 


Thr 


Leu 


Asn 








275 










280 










285 










Ser 


Asp 


Gly 


Tyr 


Thr 


Pro 


Glu 


Pro 


Ala 


Arg 


He 


Thr 


Ser 


Pro 


Asp 


Lys 






290 










295 










300 










50 


Pro 


Arg 


Pro 


Met 


Pro. 


Met 


Asp 


Thr 


Ser 


Val 


Tyr 


Glu 


Ser 


Pro 


Tyr 


Ser 




305 










310 










315 










320 




Asp 


Pro 


Glu 


Glu 


Leu 


Lys 


Asp 


Lys 


Lys 


Leu 


Phe 


Leu 


Lys 


Arg 


Asp 


Asn 












325 










330 










335 






Leu 


Leu 


He 


Ala 


Asp 


He 


Glu 


Leu 


Gly 


Cys 


Gly 


Asn 


Phe 


Gly 


Ser 


Val 


55 








340 










345 










350 








Arg 


Gin 


Gly 


Val 


Tyr 


Arg 


Met 


Arg 


Lys 


Lys 


Gin 


He 


Asp 


Val 


Ala 


He 
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365 

Lys Val Leu Lys Gin Glv Thr nli, tx,c tm * , 

370 I!!^ ^ "^^^ Glu Glu Met Met 

375 "inn 
Arg Olu Ala Gin Xle Met His Oln Leu Asp Asn .rl ryr Xle Val .r. 



Leu lie Oly Val Cys Gin Ala Glu Ala Leu III Leu Val Met CZu 21 



400 



''OS 410 



Ala Gly Oly Gly Pro Leu His Lys Phe Leu Val oly Lys Arg X Glu 



.0 lie pro val Ser Asn Val Ala Glu III .eu His Gin Val ^er Met Gly 
Met Lys Tyr Leu Glu Glu Lys lln Phe Val His Arg Leu Ala Ala 

Val 
470 



460 



Arg Asn Val Leu Leu Val Asn Arg His ^r Ala Lys He Ser Asp Phe 
Gly Leu ser Lys Ala Leu cly Ala Asp Asp III Tyr ^r Thr Ala Irg 



490 

ser Ala Gly Lys Trp Pro Leu Lys Trp .yr Ala Pro Glu Cys ^l" Asn 



0 Phe Arg Lys Phe Ser Ser Arg Ser lit Val Trp Ser Tyr Gl" Val Thr 
Met Trp Glu Ala Leu Ser Tyr lly Gin Lys Pro Tyr l" Lys Met Lys 



"5 



Oly Pro Glu val Met Ala Phe Xle Glu Gin Gly Lys Arg Met Glu Cys 

Pro Pro Glu Cys Pro Pro Glu Leu Tyr Ala Leu Met Ser Asp Cys t" 

lie Tyr Lys Trp Glu Asp Arg Pro Asp III Leu Thr Val Glu Gl'n Arg 

Met Arg Ala Cys Tyr Tyr Ser Leu Al" Ser Lys Val Glu Gly Pro Pro 

Gly ser Thr Gin Lys Ala Glu III Ala Cys Ala Trp As" Pro Pro Val 

Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Sr Gly Val Val Pro 

lie Leu val Glu Leu Asp Gly Asp Val Asn Oly His Lys Phe Ser V.I 

650 

ser Gly Glu Gly Glu Cly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 
Phe lie cys Thr Thr Gly Lys Leu Sro Val Pro Trp Pro Ihr Leu Val 
Thr Thr Leu Thr ryr Gly Val HI cys Phe Ser Arg "r Pro Asp His 
Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val 
Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gl'y Asn Tyr Lys Thr Irg 
Ala Glu val Lys Phe Glu Gly Asp Thr Jll val Asn Arg xle Glu Leu 
X.ys Gly lie ASP Phe Lys Glu Asp g" Asn Xle Leu Gly HI .ys Leu 
Glu Tyr Asn Tyr Asn Ser His III Val ^ xle Met 111 Asp Lys Gin 
Lys Asn Gly Xle Lys Val Asn Phe Lys Xle Arg Asn xle Glu Asp 

Gly ser Val Gin Leu Ala Asp His Tyr Gin Gl^ Asn Thr Pro Xle oJy 



204 



SUBSTITUTE SHEET (RULE 26) 



10 
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55 



WO 98/45704 PCT/DK98/001 45 

205 

805 810 815 

Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser 

820 825 830 

Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu 

835 840 645 

Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr 
850 855 860 

Lys 
865 

(2) INFORMATION FOR SEQ ID NO: 112: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1635 base pairs 

15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
20 (ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION : 1 . . . 1632 
<D) OTHER INFORMATION: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 



ATG GAG AAC TTC CAA AAG GTG GAA AAG ATC GGA GAG GGC ACG TAC GGA 4 8 

Met Glu Asn Phe Gin Lys Val Glu Lys lie Gly Glu Gly Thr Tyr Gly 
30 1 5 10 15 

GTT GTG TAC AAA GCC AGA AAC AAG TTG ACG GGA GAG GTG GTG GCG CTT 96 
Val Val Tyr Lys Ala Arg Asn Lys Leu Thr Gly Glu Val Val Ala Leu 
20 25 30 

35 

AAG AAA ATC CGC CTG GAC ACT GAG ACT GAG GGT GTG CCC AGT ACT GCC 144 
Lys Lys lie Arg Leu Asp Thr Glu Thr Glu Gly Val Pro Ser Thr Ala 
35 40 45 

40 ATC CGA GAG ATC TCT CTG CTT AAG GAG CTT AAC CAT CCT AAT ATT GTC 192 
lie Arg Glu lie Ser Leu Leu Lys Glu Leu Asn His Pro Asn lie Val 
50 55 60 

AAG CTG CTG GAT GTC ATT CAC ACA GAA AAT AAA CTC TAC CTG GTT TTT 24 0 

45 Lys Leu Leu Asp Val lie His Thr Glu Asn Lys Leu Tyr Leu Val Phe 
65 70 75 80 

GAA TTT CTG CAC CAA GAT CTC AAG AAA TTC ATG GAT GCC TCT GCT CTC 288 
Glu Phe Leu His Gin Asp Leu Lys Lys Phe Met Asp Ala Ser Ala Leu 
50 85 90 95 

ACT GGC ATT CCT CTT CCC CTC ATC AAG AGC TAT CTG TTC CAG CTG CTC 336 
Thr Gly lie Pro Leu Pro Leu lie Lys Ser Tyr Leu Phe Gin Leu Leu 
100 105 110 



CAG GGC CTA GCT TTC TGC CAT TCT CAT CGG GTC CTC CAC CGA GAC CTT 3 84 
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Gin Gly Leu Ala Phe Cys His Ser His Arg Val Leu His Arg Asp Leu 
11= 120 



125 



AAA CCT CAG AAT CTG CTT ATT AAC ACA C5AG GGG GCC ATC AAG CTA GCA 
I^ys Pro Gin Aon Leu Leu He Asn Thr Glu Gly Ala zll ™ a^^ 

"° "5 

GAC TTT GGA CTA GCC AGA GCT TTT GGA GTC CCT GTT CGT ACT TAC ACr 
ASP Phe Gly Leu Ala Arg Ala Phe Gly Val Pro Val Arg T^r rZ 

" "° 155 

Hil tin Zl f ? T^" r° '■''^ ^''^ ^ ^ <^ 

His Glu val Val Thr Leu Trp Tyr Arg Ala Pro Glu He Leu Leu Gly 

165 170 175 

TCG AAA TAT TAT TCC ACA GCT GTG GAC ATC TCG AGC CTG GGC TGC ATC 
ser Lys Tyr Tyr Ser Thr Ala Val Asp He Trp Ser Leu Gly Cys lie 
180 IBS 190 



ctu ^^-^ T^-r 

Phe Ala Glu Met Val Thr Arg Arg Ala Leu Phe Pro Gly Asp Ser Glu 

195 200 205 

iZ tin If '^'^^ ''^^ ^'^^ CCA GAT GAG 

He Asp Gin Leu Phe Arg He Phe Arg Thr Leu Gly Thr Pro Asp Glu 

215 220 

GTG GTG TGG CCA GGA GTT ACT TCT ATG CCT GAT TAC AAG CCA AGT TTC 
val val Trp Pro Gly Val Thr Ser Met Pro Asp Tyr Lye Pro Ser pJe 
230 235 240 

Prf w° l"^ """^ """^ ^^"^ ^ GTA CCT CCC CTG GAT 

Pro Lys Trp Ala Arg Gin Asp Phe Ser Lys Val Val Pro Pro Leu Asp 

250 255 

GAA GAT GGA CGG AGC TTG TTA TCG CAA ATG CTG CAC TAC GAC CCT AAC 
Glu ASP Gly Arg Ser Leu Leu Ser Gin Met Leu His Tyr 2p P^ J^n 
260 265 270 

AAG CGG ATT TCG GCC AAG GCA GCC CTG GCT CAC CCT TTC TTC CAG GAT 
Lys Arg He Ser Ala Lys Ala Ala Leu Ala His Pro Phe Ph^ G^n A^p 
275 280 2B5 

Th^ r° f ''''^ '''''' ''^^ '^^C CCA CCG GTC GCC 

Val Thr Lys Pro Val Pro His Leu Arg Leu Trp Asp Pro Pro Val Ala 
290 295 300 

ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC 
Thr Met val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Pro He 

^ 3" 315 320 

CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC 
Leu val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser 

330 335 

GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 
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Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
340 345 350 

ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC 1104 
5 lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 
355 360 365 

ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG 1152 
Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 
10 370 375 380 

AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG 1200 

Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
385 390 395 400 

15 

GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC 124 8 

Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 
405 410 415 

20 GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG 1296 
Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys 
420 425 430 

GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG 1344 
25 Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu 
435 440 445 

TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG 1392 
Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys 
30 450 455 460 

AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC 144 0 

Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly 
465 470 475 • 480 

35 

AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC 14 8 8 

Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp 
485 490 495 

40 GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC 1536 
Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 
500 505 510 

CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG 1584 
45 Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu 
515 520 525 

TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG T 1633 
Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
50 530 535 540 

AA 1635 
(2) INFORMATION FOR SEQ ID NO: 113: 

55 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 544 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(Xi) SEQtJENCE DESCRIPTION: SEQ ID N0:113: 

Met Glu Asn Phe Gin Lys Val Glu Lys lie Gly Glu Gly Thr Tyr Gly 

Val val Tyr Lys Ala Arg Asn Lys Leu ^hr Gly Glu Val Val lla Leu 

20 25 30 

Lys Lys lie Arg Leu Asp Thr Glu Thr Glu Gly Val Pro Ser Thr Ala 

He Arg Glu He Ser Leu Leu Lys Glu Leu Asn His llo Asn He Val 

55 60 
Lys Leu Leu Asp Val lie His Thr Glu Asn Lys Leu Tyr Leu Val Phe 

Glu Phe Leu His Gin Asp Leu Lys Lys Phe Met Asp Ala Ser Ala llu 

90 55 
Thr Gly He Pro Leu Pro Leu He Lys Ser Tyr Leu Phe Gin Leu Leu 

105 

Gin Gly Leu Ala Phe Cys His Ser His Arg Val Leu His Arg Asp Leu 

120 125 
Lys Pro Gin Asn Leu Leu He Asn Thr Glu Gly Ala He Lys Leu Ala 

■^•^^ 135 



140 

Asp Phe Gly Leu Ala Arg Ala Phe Gly Val Pro Val Arg Thr Tyr Thr 

-'-50 155 
His Glu val val Thr Leu Trp Tyr Arg Ala Pro Glu He Leu Leu G^y 



165 170 



n » J- / \J 175 

ser Lys Tyr Tyr Ser Thr Ala Val Asp He Trp Ser Leu Gly Cys He 

185 190 
Phe Ala Glu Met Val Thr Arg Arg Ala Leu Phe Pro Gly Asp Ser Glu 



wvr 205 

He Asp Gin Leu Phe Arg He Phe Arg Thr Leu Gly Thr Pro Asp Glu 

val val Trp Pro Gly Val Thr Ser Met Pro Asp Jyr Lys Pro Ser Phe 
230 235 240 



Pro Lys Trp Ala Arg Gin Asp Phe Ser Lys Val Val Pro Pro Leu 12 

250 255 
Glu ASP Gly Arg Ser Leu Leu Ser Gin Met Leu His Tyr Asp Pro Asn 

2 65 2 
I.ys Arg He Ser Ala Lys Ala Ala Leu Ala His Pro Phe Phe Gin Asp 

28 0 

val Thr Lys Pro Val Pro His Leu Arg Leu Trp Asp Pro Pro Val Ala 

Thr Met val Ser Lys Gly Glu Glu Leu Phe Thr G^y Val Val Pro He 

315 320 



Leu val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val S« 

Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Uys Phe 

He cys Thr Thr Gly Lys Leu Pro v" Pro Trp Pro Thr Jeu Val Thr 
355 360 
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40 



WO 98/45704 PCT/DK98/00145 

209 





Thr 


Leu 


Thr 


Tyr 


Gly Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 


Pro 


Asp 


His 


Met 






370 










375 










380 












Lys 


Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly 


Tyr 


Val 


Gin 




385 










390 










395 










400 


5 


Glu 


Arg 


Thr 


He 


Phe 
405 


Phe 


Lys 


Asp 


Asp 


Gly 
410 


Asn 


Tyr 


Lys 


Thr 


Arg 
415 


Ala 




Glu 


Val 


Lys 


Phe 
420 


Glu 


Gly 


Asp 


Thr 


Leu 
425 


Val 


Asn 


Arg 


He 


Glu 
430 


Leu 


Lys 




Gly 


He 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly Asn 


He 


Leu 


Gly His 


Lys 


Leu 


Glu 


10 






435 










440 










445 










Tyr 


Asn 
450 


Tyr 


Asn 


Ser 


His 


Asn 
455 


Val 


Tyr 


He 


Met 


Ala 
460 


Asp 


Lys 


Gin 


Lys 




Asn 


Gly 


He 


Lys 


Val 


Asn 


Phe 


Lys 


He 


Arg 


His 


Asn 


He 


Glu 


Asp 


Gly 




465 










470 










475 










4B0 


15 


Ser 


Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


Pro 


He 


Gly Asp 












485 










490 










495 






Gly 


Pro 


Val 


Leu 
500 


Leu 


Pro 


Asp 


Asn 


His 
505 


Tyr 


Leu 


Ser 


Thr 


Gin 
510 


Ser 


Ala 




Leu 


Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 


His 


Met 


Val 


Leu 


Leu 


Glu 


20 






515 










520 










525 










Phe 


Val 
530 


Thr 


Ala 


Ala 


Gly 


He 
535 


Thr 


Leu 


Gly 


Met 


Asp 
540 


Glu 


Leu 


Tyr 


Lys 



(2) INFORMATION FOR SEQ ID NO: 114: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
< ix ) FEATURE : 

35 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . . 1632 
(D) OTHER INFORMATION: 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

45 GTC GAG CTG GAC GGC GAC GTA 7U\C GGC CAC AAG TTC AGC GTG TCC GGC 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 

50 Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

55 50 55 60 
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210 



70 7- 



CAG CAC GAC TTC TTC AAG TCC GCC ATO CCC GAA GGC TAC GTC CAG GAG 
Gin Hxs ASP Phe Phe Lys Ser Ala Met Pro Glu Gly vl^ G^n 

85 — 



90 



95 



S rnr lit f ° -^^^ GCC GAG 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

"° 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Su JJs Gly 



125 



ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 
He ASP Phe Lys Glu Asp Gly Asn He Leu Gly His Lys lIu G^u 



135 140 



^^n ?Jr Ser S^s v"? T ^^'^ ^° 

^^n Tyr Asn Ser Hxs Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

155 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 
Gly He Lys Val Asn Phe Lys He Arg His Asn He ail Asp Oly ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 
val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro ill Gly A^p G^y 



185 



190 



Pro lIu f '^'^ ""^^ ™^ "° '^^^ GCC CTG 
Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 205 

ser ^ A=f p'^'^ f ^ ATG GTC CTG CTG GAG TTC 

ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

220 

vTl aS ^ iT '^^^ ™^ TCC 

225 ^ '''' "^"^ l-eu Tyr Lys Ser 



235 



240 



GGA CTC AGA TCT CGA GCC ATG GAG AAC TTC CAA AAG GTG GAA AAG ATC 
Gly Leu Arg ser Arg Ala Met Glu Asn Phe Gin Lys Val G^J He 



245 pen 

250 255 



2 o 0 « — 



265 



270 



gS gJu Sa f"^ 

Gly Glu val val Ala Leu Lys Lys He Arg Leu Asp Thr Glu Thr Glu 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



280 285 



210 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 PCT/DK98/00145 

211 

GGT GTG CCC AGT ACT GCC ATC CGA GAG ATC TCT CTG CTT AAG GAG CTT 912 
Gly Val Pro Ser Thr Ala lie Arg Glu lie Ser Leu Leu Lys Glu Leu 
290 295 300 

5 AAC CAT CCT AAT ATT GTC AAG CTG CTG GAT GTC ATT CAC ACA GAA AAT 960 

Asn His Pro Asn lie Val Lys Leu Leu Asp Val lie His Thr Glu Asn 
305 310 315 320 

AAA CTC TAC CTG GTT TTT GAA TTT CTG CAC CAA GAT CTC AAG AAA TTC 1008 
10 Lys Leu Tyr Leu Val Phe Glu Phe Leu His Gin Asp Leu Lys Lys Phe 

325 330 335 

ATG GAT GCC TCT GCT CTC ACT GGC ATT CCT CTT CCC CTC ATC AAG AGC 1056 
Met Asp Ala Ser Ala Leu Thr Gly lie Pro Leu Pro Leu lie Lys Ser 
15 340 345 350 

TAT CTG TTC CAG CTG CTC CAG GGC CTA GCT TTC TGC CAT TCT CAT COG 1104 

Tyr Leu Phe Gin Leu Leu Gin Gly Leu Ala Phe Cys His Ser His Arg 
355 360 365 

20 

GTC CTC CAC CGA GAC CTT AAA CCT CAG AAT CTG CTT ATT AAC ACA GAG 1152 

Val Leu His Arg Asp Leu Lys Pro Gin Asn Leu Leu lie Asn Thr Glu 
370 375 380 

25 GGG GCC ATC AAG CTA GCA GAC TTT GGA CTA GCC AGA GCT TTT GGA GTC 1200 
Gly Ala lie Lys Leu Ala Asp Phe Gly Leu Ala Arg Ala Phe Gly Val 
385 390 395 400 

CCT GTT CGT ACT TAC ACC CAT GAG GTG GTG ACC CTG TGG TAC CGA GCT 124 8 

30 Pro Val Arg Thr Tyr Thr His Glu Val Val Thr Leu Trp Tyr Arg Ala 

405 410 415 

CCT GAA ATC CTC CTG GGC TCG AAA TAT TAT TCC ACA GCT GTG GAC ATC 12 96 

Pro Glu lie Leu Leu Gly Ser Lys Tyr Tyr Ser Thr Ala Val Asp lie 
35 420 425 430 

TGG AGC CTG GGC TGC ATC TTT GCT GAG ATG GTG ACT CGC CGG GCC CTG 1344 

Trp Ser Leu Gly Cys He Phe Ala Glu Met Val Thr Arg Arg Ala Leu 
435 440 445 

40 

TTC CCT GGA GAT TCT GAG ATT GAC CAG CTC TTC CGG ATC TTT CGG ACT 1392 

Phe Pro Gly Asp Ser Glu He Asp Gin Leu Phe Arg He Phe Arg Thr 

450 455 460 

45 CTG GGG ACC CCA GAT GAG GTG GTG TGG CCA GGA GTT ACT TCT ATG CCT 144 0 

Leu Gly Thr Pro Asp Glu Val Val Trp Pro Gly Val Thr Ser Met Pro 
465 470 475 480 

GAT TAC AAG CCA AGT TTC CCC AAG TGG GCC CGG CAA GAT TTT AGT AAA 1488 
50 Asp Tyr Lys Pro Ser Phe Pro Lys Trp Ala Arg Gin Asp Phe Ser Lys 

485 490 495 

GTT GTA CCT CCC CTG GAT GAA GAT GGA CGG AGC TTG TTA TCG CAA ATG 1536 
Val Val Pro Pro Leu Asp Glu Asp Gly Arg Ser Leu Leu Ser Gin Met 
55 500 505 510 



211 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/DK98/00145 



212 



CTG CAC TAG GAG CGT AAC AAG COG ATT TGG GCC AAG GCA GCC CTG GCT 

Leu Has Asp Pro Asn Lys Arg Xle Ser Ala Lys S Ma SI """" 



525 



CAC CCT TTC TTC CAG GAT GTG ACC AAG CCA 6TA CCC CAT CTT CGA CTC T 
Hxs Pro Phe Phe Gin Asp Val Thr Lys Pro Val Pro His 12 Arg ^eu 

540 



GA 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

Met val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

10 15 
val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

Cys Thr Thr Gly Lys Leu Pro V^l Pro Trp Pro Thr ^eu Val Thr Thr 

55 go 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin gL 

90 95 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

105 110 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

120 125 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

135 140 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

Gly He Lys Val Asn Phe Lys He Arg His III He Glu Asp Gly Ser 

170 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 T Q C 

±03 190 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

215 

val Thr Ala Ala Gly He Thr Leu Gly Met Asp III Leu Tyr Lys Ser 



Gly Leu Arg Ser Arg Ala Met Glu Asn Phe G^n Lys Val Glu Lys He 
Gly Glu Gly Thr Tyr Gly Val Val Tyr Jys Ala Arg Asn Lys "u Thr 



1635 
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260 265 270 





Gly Glu 


Val 


Val 


Ala 




Lys 




X J. C 


Arg 


Leu 


Asp 


Thr 




Thr 


m 11 

xjXU 








^ / o 










5 R n 
^ o u 










o o c 










Gly Val 


Pro 


Ser 


Thr 


Ala 


He 


ArQ 


Glu 


He 


Ser 


Leu 


Leu 


Lys 


Glu 


Leu 


c 

u 




290 










•c ^ 3 










^ u u 












Asn 


His 


Pro 


Asn 


He 


Val 


Lys 


Le 


Leu 


Asp 


Val 


J. X6 


rix a 


X xix 


Glu 


Asn 




305 




















^ Xo 










"Son 
J Z U 




Lys 


Leu 


Tyr 


Leu 


Val 


Phe 


Glu 


file 


Leu 




f2l n 


Asp 


Leu 


Lys 


Lys 














•a •> c 










330 










335 






Met 


Asp 


Ala 
A J. a. 


Ser 




Leu 


TV* V 


Ljj.y 


xxe 


Pro 


Leu 


Pro 


Leu 


X xe 


Lys 


Ser 










340 










345 










350 








Tyr 


Leu 


Pne 


vj±n. 


Leu 


Leu 


tij.n 


Gly 


Leu 


TV T -1 

Ala 


Phe 


Cys 


His 


Ser 


HxS 


Arg 








3 55 










360 










365 










Val 


Leu 


His 


Arg 


Asp 


Leu 


Lys 


Pro 


Gin 


Asn 


Leu 


Leu 


lie 


Asn 


Thr 


Glu 


15 




370 










375 










380 












Gly Ala 


T 1 ea 

± le 


Lys 


Leu 


Ala 


Asp 


Phe 


\jiy 


Leu 


Aia 


Arg 


Ala 


nil A 

pne 


uiy 


vai 




385 










3 90 










3 95 










400 




Pro 


Val 


Arcf 


Thr 


Tyr 


Thr 


His 


Glu 


Val 


Val 


Thr 


Leu 


Trp 


Tyr 


Arg 


TV 1 

Ala 












405 










410 










415 






Pro 


Glu 


T 1 » 

±±e 


Leu 


Leu 


\3±y 


oer 


Lys 


Tyr 


Tyr 


Ser 


Thr 


Aia 


vai 


Asp 


lie 










420 










425 










43 0 








Trp 


Ser 


Leu 


Gly 


Cys 


i.j.e 


Fne 


A±a 


Glu 


Met 


Val 


Thr 


Arg 


Arg 


Ala 

Aia 


Leu 








435 










440 










445 










Phe 


Pro 


Gly 


Asp 


Ser 


Glu 


He 


Asp 


Gin 


Leu 


Phe 


Arg 


He 


Phe 


Arg 


Thr 


25 




450 










455 










460 












Leu 


Gly 


Thr 


Pro 


Asp 


Glu 


Val 


Val 


Trp 


Pro 


Gly 


Val 


Thr 


Ser 


Met 


Pro 




465 










470 










475 










480 




Asp 


Tyr 


Lys 


Pro 


Ser 


Phe 


Pro 


Lys 


Trp 


Ala 


Arg 


Gin 


Asp 


Phe 


Ser 


Lys 












485 










490 










495 




30 


Val 


val 


Pro 


Pro 


Leu 


Asp 


Glu 


Asp 


Gly 


Arg 


Ser 


Leu 


Leu 


Ser 


Gin 


Met 










500 










505 










510 








Leu 


His 


Tyr 


Asp 


Pro 


Asn 


Lys 


Arg 


He 


Ser 


Ala 


Lys 


Ala 


Ala 


Leu 


Ala 








515 










520 










525 










His 


Pro 


Phe 


Phe 


Gin 


Asp 


Val 


Thr 


Lys 


Pro 


Val 


Pro 


His 


Leu 


Arg 


Leu 


35 




530 










535 










540 











(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2532 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: l..,2529 
50 (D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
55 Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 
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GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser vll Ser Oly 



25 



30 



GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 11. 



40 



Ss TUr ?hr G?v '''''' """^ ^"^^ ACC 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 



55 



60 



CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His H^t JJs 
'° 75 ao 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 
Gin Has ASP Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Va^ G^n G^u 
85 90 95 

S rlr ?Ie ^e 'Z f ^ ^'''^ '^'^C GCC GAG 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

"° 105 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 
val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu SJs Tly 



120 



125 



ne "^"^ ^^'^ "° AAG CTG GAG TAC 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 



140 



AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys 

155 160 



S He ?Il f ^^"^ ^= ^C °AC GGC AGC 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

vlt X llu A^ ^^"^ "^^^ ^C'^ CC<^ ATC GGC GAC GGC 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro vTl l""" p^'' '^'^ "^^"^ ''^^ ^C'^ CAG TCC GCC CTO 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTO CTG GAG TTC 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 



215 



220 



vTl Jhr Alf 1'^'' of tT"^ """^^ ^« TCC 

Val Thr Ala Ala. Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 

235 240 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 
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GGA CTC AGA TCT CGA GAG ATG CTG TCC CGT GGG TGG TTT CAC CGA GAC 768 

Gly Leu Arg Ser Arg Glu Met Leu Ser Arg Gly Trp Phe His Arg Asp 
245 250 255 

5 

CTC AGT GGG CTG GAT GCA GAG ACC CTG CTC AAG GGC CGA GGT GTC CAC 816 

Leu Ser Gly Leu Asp Ala Glu Thr Leu Leu Lys Gly Arg Gly Val His 

260 265 270 

10 GGT AGC TTC CTG GCT CGG CCC AGT CGC AAG AAC CAG GGT GAC TTC TOG 864 

Gly Ser Phe Leu Ala Arg Pro Ser Arg Lys Asn Gin Gly Asp Phe Ser 
275 2B0 285 

CTC TCC GTC AGG GTG GGG GAT CAG GTG ACC CAT ATT CGG ATC CAG AAC 912 

15 Leu Ser Val Arg Val Gly Asp Gin Val Thr His lie Arg lie Gin Asn 
290 295 300 



TCA GGG GAT TTC TAT GAC CTG TAT GGA GGG GAG AAG TTT GCG ACT CTG 960 
Ser Gly Asp Phe Tyr Asp Leu Tyr Gly Gly Glu Lys Phe Ala Thr Leu 
20 305 310 315 320 



ACA GAG CTG GTG GAG TAG TAG ACT CAG CAG CAG GGT GTC CTG CAG GAC 1008 

Thr Glu Leu Val Glu Tyr Tyr Thr Gin Gin Gin Gly Val Leu Gin Asp 
325 330 335 

25 

CGC GAC GGC ACC ATC ATC CAC CTC AAG TAG CCG CTG AAC TGC TCC GAT 1056 

Arg Asp Gly Thr lie lie His Leu Lys Tyr Pro Leu Asn Cys Ser Asp 
340 345 350 



30 CCC ACT AGT GAG AGG TGG TAG CAT GGC CAC ATG TCT GGC GGG CAG GCA 1104 
Pro Thr Ser Glu Arg Trp Tyr His Gly His Met Ser Gly Gly Gin Ala 
355 360 365 



GAG ACG CTG CTG CAG GCC AAG GGC GAG CCC TGG ACG TTT CTT GTG CGT 1152 
35 Glu Thr Leu Leu Gin Ala Lys Gly Glu Pro Trp Thr Phe Leu Val Arg 
370 375 380 



GAG AGC CTC AGC CAG CCT GGA GAC TTC GTG CTT TCT GTG CTC AGT GAC 1200 
Glu Ser Leu Ser Gin Pro Gly Asp Phe Val Leu Ser Val Leu Ser Asp 
40 385 390 395 400 



CAG CCC AAG GCT GGC CCA GGC TCC CCG CTC AGG GTC ACC CAC ATC AAG 1248 

Gin Pro Lys Ala Gly Pro Gly Ser Pro Leu Arg Val Thr His lie Lys 

405 410 415 

45 

GTC ATG TGC GAG GGT GGA CGC TAC ACA GTG GGT GGT TTG GAG ACC TTC 1296 

Val Met Cys Glu Gly Gly Arg Tyr Thr Val Gly Gly Leu Glu Thr Phe 

420 425 430 

50 GAC AGC CTC ACG GAC CTG GTA GAG CAT TTC AAG AAG ACG GGG ATT GAG 1344 

Asp Ser Leu Thr Asp Leu Val Glu His Phe Lys Lys Thr Gly lie Glu 

435 440 445 



GAG GCC TCA GGC GCC TTT GTC TAC CTG CGG CAG CCG TAC TAT GCC ACG 1392 
55 Glu Ala Ser Gly Ala Phe Val Tyr Leu Arg Gin Pro Tyr Tyr Ala Thr 
450 455 460 
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5.1 z s tr. z ^11 ^ -I - ^. 

475 

^. I- s s 2.^ s s s s s:^ s 



490 



495 



505 



510 



GGG CAG CGG CCA GAG AAC AAG GGC AAG AAC CGC TAC AAG AAC ATT CTC 
Gly Gin Arg Pro Glu Asn Lys Gly .ys Asn Arg .yr J^s Su 

520 

s ^ zi ^. iST^zin s s= z - 



540 



S Z ^ Z S Z I- S 

555 S60 

GGC CCT GAT GAG AAC GCT AAG ACC TAC ATC GCC AGC CAG GGC TGT CTG 
Gly Pro ASP Glu Asn Ala Lys Thr Tyr He Ala Ser G^y 

570 

Tu 111 f ^ '^^'^ '^^^ -AG AAC AGC 

Glu Ala Thr Val Asn Asp Phe Trp Gin Met Ala Trp Gin Glu Asn Ser 



585 



590 



600 



605 



vij ^ - - 1- - - 

620 

?S J^n °f T ^AC AAA CTC 

«5 ^^'^ '^^P Thr Glu Tyr Lys Leu 

635 640 

S S S S ^7. S 

650 655 

Hil ^""^ "° °A<= <=«° OTC CCC AGT 

lie Trp Hxs Tyr Gin Tyr Leu Ser Trp Pro Asp His Gly Val Pro sll 



665 



670 



G?u p^^^ Ty G?; Su' r r oT '^^^ -° -° 

y ax i,eu Ser Phe Leu Asp Gin He Asn Gin Arg Gin 



680 



685 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 
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GAA AGT CTG CCT CAC OCA GGG CCC ATC ATC GTG CAC TGC AGC GCC GGC 2112 

Glu Ser Leu Pro His Ala Gly Pro He He Val His Cys Ser Ala Gly 
690 695 700 

5 

ATC GGC CGC AC A GGC ACC ATC ATT GTC ATC GAC ATG CTC ATG GAG AAC 2160 

He Gly Arg .Thr Gly Thr He He Val He Asp Met Leu Met Glu Asn 
705 710 715 720 

10 ATC TCC ACC AAG GGC CTG GAC TGT GAC ATT GAC ATC CAG AAG ACC ATC 2208 
He Ser Thr Lys Gly Leu Asp Cys Asp He Asp He Gin Lys Thr He 
725 730 735 

CAG ATG GTG CGG GCG CAG CGC TCG GGC ATG GTG CAG ACG GAG GCG CAG 22 56 

15 Gin Met Val Arg Ala Gin Arg Ser Gly Met Val Gin Thr Glu Ala Gin 
740 745 750 

TAC AAG TTC ATC TAG GTG GCC ATC GCC CAG TTC ATT GAA ACC ACT AAG 2304 
Tyr Lys Phe He Tyr Val Ala He Ala Gin Phe He Glu Thr Thr Lys 
20 755 760 765 

AAG AAG CTG GAG GTC CTG CAG TCG CAG AAG GGC CAG GAG TCG GAG TAC 2352 
Lys Lys Leu Glu Val Leu Gin Ser Gin Lys Gly Gin Glu Ser Glu Tyr 
770 775 780 

25 

GGG AAC ATC ACC TAT CCC CCA GCC ATG AAG T^T GCC CAT GCC AAG GCC 2400 
Gly Asn He Thr Tyr Pro Pro Ala Met Lys Asn Ala His Ala Lys Ala 
785 790 795 800 

30 TCC CGC ACC TCG TCC AAA CAC AAG GAG GAT GTG TAT GAG AAC CTG CAC 2448 
Ser Arg Thr Ser Ser Lys His Lys Glu Asp Val Tyr Glu Asn Leu His 
805 810 815 

ACT AAG AAC AAG AGG GAG GAG AAA GTG AAG AAG CAG CGG TCA GCA GAC 2496 
35 Thr Lys Asn Lys Arg Glu Glu Lys Val Lys Lys Gin Arg Ser Ala Asp 
820 825 830 

AAG GAG AAG AGC AAG GGT TCC CTC AAG AGG AAG TGA 2532 
Lys Glu Lys Ser Lys Gly Ser Leu Lys Arg Lys 
40 835 840 



(2) INFORMATION FOR SEQ ID NO: 117: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



50 



55 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
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Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
^ Glu Gly Glu Gly Asp Ala Thr Tyr "y Lys Leu Thr Leu lys Phe He 

Cys Thr Thr Gly Lys Leu Pro 111 Pro Trp Pro Thr ieu Val Thr Thr 

60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
70 75 ^ 

10 Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

Arg Thr He Phe Phe Lys Asp Asp Gly IL Tyr Lys Thr Arg Ala Glu 

105 

val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

120 125 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

20 Gly He Lys Val Asn Jhe Lys He Arg His Asn He Glu Asp Gly III 

170 J^g 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 igo 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 



30 



215 220 



val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 

230 235 
Gly Leu Arg Ser Arg Glu Met Leu Ser Arg Gly Trp Phe His Arg A^p 

250 255 
Leu ser Gly Leu Asp Ala Glu Thr Leu Leu Lys Gly Arg Gly Val His 

260 265 270 

Gly ser Phe Leu Ala Arg Pro Ser Arg Lys Asn Gin Gly Asp Phe Ser 

275 280 285 

Leu ser Val Arg Val Gly Asp Gin Val Thr His He Arg He Gin Asn 

295 300 
ser Gly Asp Phe Tyr Asp Leu Tyr Gly Gly Glu Lys Phe Ala Thr Leu 

40 Thr Glu Leu Val Glu ^Jr Tyr Thr Gin Gin g" Gly Val Leu Gin A^p 

330 335 
Arg Asp Gly Thr He He His Leu Lys Tyr Pro Leu Asn Cys Ser Asp 

350 



Pro Thr ser Glu Arg Trp Tyr His g" His Met Ser Gly Gl^ Gin Ala 

360 

Glu Thr Leu Leu Gin Ala Lys Gly Glu Pro Trp Thr Phe Leu Val Arg 

375 380 
Glu ser Leu Ser Gin Pro Gly Asp Phe Val Leu Ser Val Leu Ser Asp 
390 395 *' 

50 Gin Pro Lys Ala Gly Pro Gly Ser Pro Leu Arg Val Thr His He Lys 

410 

Val Met Cys Glu Gly Gly Arg Tyr Thr Val Gly Gly Leu Glu Thr Phe 

430 



420 425 



55 ^^"^ '''''' ""^P I-ys Thr Gly He Glu 

Glu Ala ser Gly Ala Phe Val Tyr Leu Arg Gin Pro Tyr Tyr Ala Thr 
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450 










455 










460 












Arg 


Val 


Asn 


Ala 


Ala 


Asp 


He 


Glu 


Asn 


Arg 


Val 


Leu 


Glu 


Leu 


Asn 


Lys 




465 










470 










475 










480 




Lys 


Gin 


Glu 


Ser 


Glu 


Asp 


Thr 


Ala 


Lys 


Ala 


Gly 


Phe 


Trp 


Glu 


Glu 


Phe 


5 










485 










490 










495 






Glu 


Ser 


Leu 


Gin 
500 


Lys 


Gin 


Glu 


Val 


Lys 
505 


Asn 


Leu 


His 


Gin 


Arg 
510 


Leu 


Glu 




Gly 


Gin 


Arg 


Pro 


Glu 


Asn 


Lys 


Gly 


Lys 


Asn Arg 


Tyr 


Lys 


Asn 


He 


Leu 








515 










520 










525 








10 


Pro 


Phe 


Asp 


His 


Ser 


Arg 


Val 


He 


Leu 


Gin 


Gly Arg Asp 


Ser 


Asn 


He 






530 










535 










540 












Pro 


Gly 


Ser 


Asp 


Tyr 


He 


Asn 


Ala 


Asn 


Tyr 


He 


Lys 


Asn 


Gin 


Leu 


Leu 




545 










550 










555 










560 




Gly 


Pro 


Asp 


Glu 


Asn 


Ala 


Lys 


Thr 


Tyr 


He 


Ala 


Ser 


Gin 


Gly 


Cys 


Leu 


15 










S6S 










570 










575 






Glu 


Ala 


Thr 


Val 
580 


Asn 


Asp 


Phe 


Trp 


Gin 
585 


Met 


Ala 


Trp 


Gin 


Glu 
590 


Asn 


Ser 




Arg 


Val 


He 


Val 


Met 


Thr 


Thr 


Arg 


Glu 


Val 


Glu 


Lys 


Gly Arg 


Asn 


Lys 








595 










600 










605 








20 


Cys 


Val 
610 


Pro 


Tyr 


Trp 


Pro 


Glu 
615 


Val 


Gly 


Met 


Gin 


Arg 
620 


Ala 


Tyr 


Gly 


Pro 




Tyr 


Ser 


Val 


Thr 


Asn 


Cys 


Gly 


Glu 


His 


Asp 


Thr 


Thr 


Glu 


Tyr 


Lys 


Leu 




625 










630 










635 










640 




Arg 


Thr 


Leu 


Gin 


Val 


Ser 


Pro 


Leu 


Asp 


Asn 


Gly Asp 


Leu 


He 


Arg 


Glu 


25 










645 










650 










655 






lie 


Trp 


His 


Tyr 


Gin 


Tyr 


Leu 


Ser 


Trp 


Pro 


Asp 


His 


Gly Val 


Pro 


Ser 










660 










665 










670 








Glu 


Pro 


Gly 
675 


Gly 


Val 


Leu 


Ser 


Phe 
680 


Leu 


Asp 


Gin 


He 


Asn 
685 


Gin 


Arg 


Gin 


30 


Glu 


Ser 


Leu 


Pro 


His 


Ala 


Gly 


Pro 


He 


He 


Val 


His 


Cys 


Ser 


Ala Gly 






690 










695 










700 












lie 


Gly 


Arg 


Thr 


Gly 


Thr 


He 


He 


Val 


He 


Asp 


Met 


Leu 


Met 


Glu 


Asn 




705 










710 










715 










720 




He 


Ser 


Thr 


Lys 


Gly 


Leu 


Asp 


Cys 


Asp 


He 


Asp 


He 


Gin 


Lys 


Thr 


He 


35 










725 










730 










735 






Gin 


Met 


Val 


Arg 
740 


Ala 


Gin 


Arg 


Ser 


Gly 
745 


Met 


Val 


Gin 


Thr 


Glu 
750 


Ala 


Gin 




Tyr 


Lys 


Phe 
755 


He 


Tyr 


val 


Ala 


He 
760 


Ala 


Gin 


Phe 


He 


Glu 
765 


Thr 


Thr 


Lys 


40 


Lys 


Lys 
770 


Leu 


Glu 


Val 


Leu 


Gin 
775 


Ser 


Gin 


Lys 


Gly 


Gin 
780 


Glu 


Ser 


Glu 


Tyr 




Gly 


Asn 


He 


Thr 


Tyr 


Pro 


Pro 


Ala 


Met 


Lys 


Asn 


Ala 


His 


Ala 


Lys 


Ala 




785 










790 










795 










800 




Ser 


Arg 


Thr 


Ser 


Ser 


Lys 


His 


Lys 


Glu 


Asp 


Val 


Tyr 


Glu 


Asn 


Leu 


His 


45 










805 










810 










815 






Thr 


Lys 


Asn 


Lys 
820 


Arg 


Glu 


Glu 


Lys 


Val 
825 


Lys 


Lys 


Gin 


Arg 


Ser 
830 


Ala 


Asp 




Lys 


Glu 


Lys 
835 


Ser 


Lys 


Gly 


Ser 


Leu 
840 


Lys 


Arg 


Lys 













50 

(2) INFORMATION FOR SEQ ID NO:118: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2562 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
<B) LOCATION: 1...25S9 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

ATG CTG TCC CGT GGG TGG TTT CAC CGA GAC CTC AGT GGG CTG GAT GCA 
Met Leu Ser Arg Gly Trp Phe His Arg Asp Leu Ser Gly Leu Asp aS 
^ 10 15 

G?U lI! 7"" """^ ^""^ CTG OCT CGG 

Glu Thr Leu Leu Lys Gly Arg Gly Val His Gly Ser Phe Leu Ala Arg 

^° 25 30 

CCC AGT CGC AAG AAC CAG GOT GAC TTC TCG CTC TCC GTC AGO GTG GGG 
Pro Ser Arg Lys Asn Gin Gly Asp Phe Ser Leu Ser Val Arg Val Gly 
35 40 45 



GAT CAG GTG ACC CAT ATT CGG ATC CAG AAC TCA GGG GAT TTC TAT GAC 
ASP Gin val Thr His He Arg He Gin Asn Ser Gly Asp Phe Sp 



60 



CTG TAT GGA GGG GAG AAG TTT GCG ACT CTG ACA GAG CTG GTG GAG TAC 
Leu Tyr Gly Gly Glu Lys Phe Ala Thr Leu Thr Glu Leu Val Glu Sr 
*>3 70 — ' 



75 



TAC ACT CAG CAG CAG GGT GTC CTG CAG GAC CGC GAC GGC ACC ATC ATC 
Tyr Thr Gin Gin Gin Gly Val Leu Gin Asp Arg Asp Gly Jhr fie 111 
85 90 



95 



CAC CTC AAG TAC CCG CTG AAC TGC TCC GAT CCC ACT AGT GAG AGG TGG 
Hxs Leu Lys Tyr Pro Leu Asn Cys Ser Asp Pro Thr Ser Glu Arg Trp 
"° 105 

TAC CAT GGC CAC ATG TCT GGC GGG CAG GCA GAG ACG CTG CTG CAG GCC 
Tyr HXS Gly His Met Ser Gly Gly Gin Ala Glu Thr Leu Leu Oln III 
lis 120 125 

f ""^ CTC AGC CAG CCT 

Lys Gly Glu Pro Trp Thr Phe Leu Val Arg Glu Ser Leu Ser Gin Pro 
"° "5 140 

GGA GAC TTC GTG CTT TCT GTG CTC AGT GAC CAG CCC AAG 6CT GGC CCA 
Gly Asp Phe Val Leu Ser Val Leu Ser Asp Gin Pro Lys Ala Gly Pro 

155 160 

Til ITr Pro Z"" f ° f f '''''' '^^^ ^^'^ -^^C GGA 

Gly Ser Pro Leu Arg Val Thr His He Lys Val Met Cys Glu Gly Gly 

1" 170 175 

CGC TAC ACA GTG GGT GGT TTG GAG ACC TTC GAC AGC CTC ACG GAC CTG 



48 



96 



144 



192 



240 



288 



336 



3 84 



432 



480 



528 



576 
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Arg Tyr Thr Val Gly Gly Leu Glu Thr Phe Asp Ser Leu Thr Asp Leu 
180 185 190 



GTA GAG CAT TTC AAG AAG ACG GGG ATT GAG GAG GCC TCA GGC GCC TTT 624 
5 Val Glu His Phe Lys Lys Thr Gly lie Glu Glu Ala Ser Gly Ala Phe 
195 200 205 



GTC TAG CTG CGG CAG CCG TAG TAT GCC ACG AGG GTG AAT GCG GCT GAC 672 
Val Tyr Leu Arg Gin Pro Tyr Tyr Ala Thr Arg Val Asn Ala Ala Asp 
10 210 215 220 



ATT GAG AAC 
lie Glu Asn 
225 

15 

ACA GCC AAG 
Thr Ala Lys 



20 GAG GTG AAG 

Glu Val Lys 



AAG GGC AAG 
25 Lys Gly Lys 
275 



CGA GTG TTG GAA CTG 
Arg Val Leu Glu Leu 
230 

GCT GGC TTC TGG GAG 
Ala Gly Phe Trp Glu 
245 

AAC TTG CAC CAG CGT 
Asn Leu His Gin Arg 
260 

AAC CGC TAC AAG AAC 
Asn Arg Tyr Lys Asn 
280 



AAC AAG AAG CAG GAG 
Asn Lys Lys Gin Glu 
235 

GAG TTT GAG AGT TTG 
Glu Phe Glu Ser Leu 
250 

CTG GAA GGG CAG CGG 
Leu Glu Gly Gin Arg 
265 

ATT CTC CCC TTT GAC 
lie Leu Pro Phe Asp 
285 



TCC GAG GAT 720 
Ser Glu Asp 
240 

CAG AAG CAG 768 
Gin Lys Gin 
255 

CCA GAG AAC 816 

Pro Glu Asn 

270 

CAC AGO CGA 864 
His Ser Arg 



GTG ATC CTG CAG GGA CGG GAC AGT AAC ATC CCC GGG TCC GAC TAC ATC 912 
Val He Leu Gin Gly Arg Asp Ser Asn He Pro Gly Ser Asp Tyr He 
30 290 295 300 



AAT GCC AAC TAC ATC AAG AAC CAG CTG CTA GGC CCT GAT GAG AAC GCT 960 
Asn Ala Asn Tyr He Lys Asn Gin Leu Leu Gly Pro Asp Glu Asn Ala 
305 310 315 320 

AAG ACC TAC ATC GCC AGC CAG GGC TGT CTG GAG GCC ACG GTC AAT GAC 1008 
Lys Thr Tyr He Ala Ser Gin Gly Cys Leu Glu Ala Thr Val Asn Asp 
325 330 335 



40 TTC TGG CAG ATG GCG TGG CAG GAG AAC AGC CGT GTC ATC GTC ATG ACC 1056 
Phe Trp Gin Met Ala Trp Gin Glu Asn Ser Arg Val He Val Met Thr 
340 345 350 

ACC CGA GAG GTG GAG AAA GGC CGG AAC AAA TGC GTC CCA TAC TGG CCC 1104 
45 Thr Arg Glu Val Glu Lys Gly Arg Asn Lys Cys Val Pro Tyr Trp Pro 
355 360 365 



GAG GTG GGC ATG CAG CGT GCT TAT GGG CCC TAC TCT GTG ACC AAC TGC 1152 
Glu Val Gly Met Gin Arg Ala Tyr Gly Pro Tyr Ser Val Thr Asn Cys 
50 370 375 380 



GGG GAG CAT GAC ACA ACC GAA TAC AAA CTC CGT ACC TTA CAG GTC TCC 1200 
Gly Glu His Asp Thr Thr Glu Tyr Lys Leu Arg Thr Leu Gin Val Ser 
385 390 395 400 

55 

CCG CTG GAC AAT GGA GAC CTG ATT CGG GAG ATC TGG CAT TAC CAG TAC 1248 
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1296 



1344 



1392 



1440 



1488 



1536 



222 

Pro Leu Asp Asn Gly Asp Leu lie Arg Glu lie Trp His Tyr Gin Tyr 
405 410 

CTG AGC TGG CCC GAC CAT GGG GTC CCC AGT GAG CCT GGG GGT GTC CTC 
Leu ser Trp Pro Asp His Gly Val Pro Ser Glu Pro Gly Gly Val Leu 
420 425 430 

AGC TTC CTG GAC CAG ATC AAC CAG CGG CAG GAA AGT CTG CCT CAC GCA 
Ser Phe Leu Asp Gin lie Asn Gin Arg Oln Glu Ser Leu Pro His Ala 
435 440 

GGG CCC ATC ATC GTG CAC TGC AGC GCC GGC ATC GGC CGC ACA GGC ACC 
Gly Pro He He Val His Cys Ser Ala Gly He Gly Arg Thr Gly Thr 

450 455 460 

ATC ATT GTC ATC GAC ATG CTC ATG GAG AAC ATC TCC ACC AAG GGC CTG 
He He Val He Asp Met Leu Met Glu Asn He Ser Thr Lys Gly Leu 
^" 470 475 480 

GAC TGT GAC ATT GAC ATC CAG AAG ACC ATC CAG ATG GTG CGG GCG CAG 
Asp Cys Asp He Asp He Gin Lys Thr He Gin Met Val Arg Ala Gin 
485 490 495 

CGC TCG GGC ATG GTG CAG ACG GAG GCG CAG TAC AAG TTC ATC TAG GTG 
Arg ser Gly Met Val Gin Thr Glu Ala Gin Tyr Lys Phe He Tyr Val 
500 505 510 

GCC ATC GCC CAG TTC ATT GAA ACC ACT AAG AAG AAG CTG GAG GTC CTG 1584 
Ala He Ala Gin Phe He Glu Thr Thr Lys Lys Lys Leu Glu Val Leu 
515 520 525 

CAG TCG CAG AAG GGC CAG GAG TCG GAG TAC GGG AAC ATC ACC TAT CCC 1632 
Gin ser Gin Lys Gly Gin Glu Ser Glu Tyr Gly Asn He Thr Tyr Pro 
530 535 540 

CCA GCC ATG AAG AAT GCC CAT GCC AAG GCC TCC CGC ACC TCG TCC AAA 1680 
Pro Ala Met Lys Asn Ala His Ala Lys Ala Ser Arg Thr Ser Ser Lys 
550 555 

CAC AAG GAG GAT GTG TAT GAG AAC CTG CAC ACT AAG AAC AAG AGG GAG 1728 
His Lys Glu Asp Val Tyr Glu Asn Leu His Thr Lys Asn Lys Arg Glu 
565 570 575 

GAG AAA GTG AAG AAG CAG CGG TCA GCA GAC AAG GAG AAG AGC AAG GGT 1776 
Glu Lys Val Lys Lys Gin Arg Ser Ala Asp Lys Glu Lys Ser Lys Gly 
580 585 590 

TCC CTC AAG AGG AAG CGA ATT CTG CAG TCG ACG GTA CCG CGG GCC CGG 1824 
Ser Leu Lys Arg Lys Arg He Leu Gin Ser Thr Val Pro Arg Ala Arg 
5^5 600 605 

GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC 1872 
Asp Pro Pro val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr 

615 620 

GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC 192 0 
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Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
625 630 635 640 

AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAG GGC AAG 1968 
5 Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 

645 650 655 

CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG 2016 
Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
10 660 665 670 

CCC ACC CTC GTG ACC ACC CTG ACC TAG GGC GTG CAG TGC TTC AGC CGC 2 064 

Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg 
675 680 685 

15 

TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC 2112 
Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro 
690 695 700 

20 GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC 2160 

Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn 
705 710 715 720 

TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC 220.8 
25 Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 

725 730 735 

CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG 2256 
Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu 
30 740 745 750 

GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG 23 04 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met 
755 760 765 

35 

GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC 2352 
Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His 
770 775 780 

40 AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC 2400 

Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
785 790 795 800 

ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG 244 8 
45 Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 

805 810 815 

AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC 2496 
Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys' Arg Asp His 
50 820 825 830 

ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG 254 4 

Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met 
835 840 845 



55 



GAC GAG CTG TAC AAG TAA 2562 
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Asp Glu Leu Tyr Lys 
850 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 853 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

Met Leu ser Arg Gly Trp Phe His Arg Asp Leu Ser Gly Leu Asp Ala 

^5 10 15 

Glu Thr Leu Leu Lys Gly Arg Gly Val His Gly Ser Phe Leu Ala Arq 

25 30 
Pro Ser Arg Lys Asn Gin Gly Asp Phe Ser Leu Ser Val Arg Val Gly 

35 40 45 

Asp Gin val Thr His He Arg He Gin Asn Ser Gly Asp Phe Tyr Asp 

55 60 
Leu Tyr Gly Gly Glu Lys Phe Ala Thr Leu Thr Glu Leu Val Glu Tyr 

5 70 75 

Tyr Thr Gin Gin Gin Gly Val Leu Gin Asp Arg Asp Gly Thr He He 

85 90 35 

His Leu Lys Tyr Pro Leu Asn Cys Ser Asp Pro Thr Ser Glu Arg Trp 

100 105 110 

Tyr His Gly His Met Ser Gly Gly Gin Ala Glu Thr Leu Leu Gin Ala 

lis 120 125 

Lys Gly Glu Pro Trp Thr Phe Leu Val Arg Glu Ser Leu Ser Gin Pro 

"° 135 140 

Gly Asp Phe Val Leu Ser Val Leu Ser Asp Gin Pro Lys Ala Gly Pro 
It 155 160 

Gly Ser Pro Leu Arg Val Thr His He Lys Val Met Cys Glu Gly Gly 

165 170 175 

Arg Tyr Thr Val Gly Gly Leu Glu Thr Phe Asp Ser Leu Thr Asp Leu 

180 185 190 

val Glu His Phe Lys Lys Thr Gly He Glu Glu Ala Ser Gly Ala Phe 

1^5 200 205 

Val Tyr Leu Arg Gin Pro Tyr Tyr Ala Thr Arg Val Asn Ala Ala Asp 
Ti 220 
lie Glu Asn Arg Val Leu Glu Leu Asn Lys Lys Gin Glu Ser Glu Asp 
230 235 240 

Thr Ala Lys Ala Gly Phe Trp Glu Glu Phe Glu Ser Leu Gin Lys Gin 

245 250 255 

Glu Val Lys Asn Leu His Gin Arg Leu Glu Gly Gin Arg Pro Glu Asn 

260 265 270 

Lys Gly Lys Asn Arg Tyr Lys Asn He Leu Pro Phe Asp His Ser Arg 

275 280 285 

Val He Leu Gin Gly Arg Asp Ser Asn He Pro Gly Ser Asp Tyr He 

295 300 
Asn Ala Asn Tyr He Lys Asn Gin Leu Leu Gly Pro Asp Glu Asn Ala 
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305 310 315 320 

Lys Thr Tyr He Ala Ser Gin Gly Cys Leu Glu Ala Thr Val Asn Asp 

325 330 335 

Phe Trp Gin Met Ala Trp Gin Glu Asn Ser Arg Val He Val Met Thr 
5 340 345 350 

Thr Arg Glu Val Glu Lys Gly Arg Asn Lys Cys Val Pro Tyr Trp Pro 

355 360 365 

Glu Val Gly Met Gin Arg Ala Tyr Gly Pro Tyr Ser Val Thr Asn Cys 
370 375 380 

10 Gly Glu His Asp Thr Thr Glu Tyr Lys Leu Arg Thr Leu Gin Val Ser 
385 390 395 400 

Pro Leu Asp Asn Gly Asp Leu He Arg Glu He Trp His Tyr Gin Tyr 

405 410 415 

Leu Ser Trp Pro Asp His Gly Val Pro Ser Glu Pro Gly Gly Val Leu 
15 420 425 430 

Ser Phe Leu Asp Gin He Asn Gin Arg Gin Glu Ser Leu Pro His Ala 

435 440 445 

Gly Pro He He Val His Cys Ser Ala Gly He Gly Arg Thr Gly Thr 
450 455 460 

20 He He Val He Asp Met Leu Met Glu Asn He Ser Thr Lys Gly Leu 
465 470 475 480 

Asp Cys Asp He Asp He Gin Lys Thr He Gin Met Val Arg Ala Gin 

485 490 495 

Arg Ser Gly Met Val Gin Thr Glu Ala Gin Tyr Lys Phe He Tyr Val 
25 500 505 510 

Ala He Ala Gin Phe He Glu Thr Thr Lys Lys Lys Leu Glu Val Leu 

515 520 525 

Gin Ser Gin Lys Gly Gin Glu Ser Glu Tyr Gly Asn He Thr Tyr Pro 
530 535 540 

30 Pro Ala Met Lys Asn Ala His Ala Lys Ala Ser Arg Thr Ser Ser Lys 
545 550 555 560 

His Lys Glu Asp Val Tyr Glu Asn Leu His Thr Lys Asn Lys Arg Glu 

565 570 575 

Glu Lys Val Lys Lys Gin Arg Ser Ala Asp Lys Glu Lys Ser Lys Gly 
35 580 585 590 

Ser Leu Lys Arg Lys Arg He Leu Gin Ser Thr Val Pro Arg Ala Arg 

595 600 605 

Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr 
610 615 620 

40 Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
625 630 635 640 

Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 

645 650 655 

Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
45 660 665 670 

Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg 

675 680 685 

Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro 
690 695 700 

50 Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn 
705 710 715 720 

Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 

725 730 735 

Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 
55 740 745 750 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 
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c c 




Ala 


Asp 


Lys 


Gin Lys 




770 






Asn 


lie 


Glu 


Asp Gly 


785 






Thr 


Pro 


lie 


Gly Asp 








805 


Ser 


Thr 


Gin 


Ser Ala 








820 


Met 


Val 


Leu 


Leu Glu 






835 




Asp 


Glu 


Leu 


Tyr Lys 




850 







760 



765 



775 



780 



790 



795 



800 



810 



815 



825 



830 



840 



845 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 994 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2991 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Met ?S Jer SJs T rf "^"^ '^'^ ^-"^ ATC CTG 

Met val ser Lye Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

5 10 15 

vll etf f ™ ^^'^ AGC GTG TCC G6C 

val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val str Ity 

X s ir. ^; z s ^. ^ - 

40 



45 



^r S IT r° r° "^"^ ^^-^ ^'^'^ ACC ACC 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 



55 60 



Su ?S Sr G?v SIl of ^ T ^^'^ ^™ 

^eu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 



70 

'5 80 



Sn His ASP Se SJe T ^''^ "^^^ ^^^^ 

Has Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 



90 95 



CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 



46 



96 



144 



192 



240 



288 



336 

226 
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Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 384 
5 Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAG 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
10 130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

15 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

20 GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
25 Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG T^G CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys hrg Asp His Met Val Leu Leu Glu Phe 
30 210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

35 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCG ACC ATG GAG CGG CCC 768 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Thr Met Glu Arg Pro 
245 250 255 

40 CCG GGG CTG CGG CCG GGC GCG GGC GGG CCC TGG GAG ATG CGG GAG CGG 816 
Pro Gly Leu Arg Pro Gly Ala Gly Gly Pro Trp Glu Met Arg Glu Arg 
260 265 270 

CTG GGC ACC GGC GGC TTC GGG AAC GTC TGT CTG TAC CAG CAT CGG GAA 864 
45 Leu Gly Thr Gly Gly Phe Gly Asn Val Cys Leu Tyr Gin His Arg Glu 
275 280 285 

CTT GAT CTC AAA ATA GCA ATT AAG TCT TGT CGC CTA GAG CTA AGT ACC 912 
Leu Asp Leu Lys He Ala He Lys Ser Cys Arg Leu Glu Leu Ser Thr 
50 290 295 3O0 

AAA AAC AGA GAA CGA TGG TGC CAT GAA ATC CAG ATT ATG AAG AAG TTG 960 
Lys Asn Arg Glu Arg Trp Cys His Glu He Gin He Met Lys Lys Leu 
305 310 315 320 



55 



AAC CAT GCC AAT GTT GTA AAG GCC TGT GAT GTT CCT GAA GAA TTG AAT 1008 

227 
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15 



20 



40 



50 



55 
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Asn His Ala Asn Val Val Lys Ala Cys Asp Val Pro Glu Glu Leu Asn 



325 



335 



ATT TTG ATT CAT GAT GTG CCT CTT CTA GCA ATG GAA TAC TGT TCT GGA 
He Leu lie His Asp Val Pro Leu Leu Ala Met Glu Tyr Cys ^ 



345 350 



GlJ til ^ll? f '^^^ TOT GGA CTT 

Gly ASP Leu Arg Lys Leu Leu Asn Lys Pro Glu Asn Cys Cys Gly Leu 
355 360 

AAA GAA AGC CAG ATA CTT TCT TTA CTA AGT GAT ATA GGG TCT GGG ATT 
Lys Glu Ser Gin He Leu Ser Leu Leu Ser Asp He Gly Ser J" 

375 380 

Arp JJr H^"" rf" '^'^ '^^^ ^ CCT GAA 

Arg Tyr Leu His Glu Asn Lys He He His Arg Asp Leu Lys Pro Glu 

395 400 

AAC ATA GTT CTT CAG GAT GTT GGT GGA AAG ATA ATA CAT AAA ATA ATT 
Asn He val Leu Gin Asp Val Gly Gly Lys He He Hil ^ He xZ 

410 

GAT CTG GGA TAT GCC AAA GAT GTT GAT CAA GGA AGT CTG TGT ACA TCT 
25 ASP Leu Gly Tyr Ala Lys Asp Val Asp Gin Gly Ser Leu Cys Ser 
420 425 430 

pJI ilt flv ?Sr r° nf ^= ^ 

Phe val Gly Thr Leu Gin Tyr Leu Ala Pro Glu Leu Phe Glu Asn Lys 



440 



445 



CCT TAC ACA GCC ACT GTT GAT TAT TCG AGC TTT GGG ACC ATG GTA TTT 
Pro Tyr Thr Ala Thr Val Asp Tyr Trp Ser Phe Gly Thr Zt vli SI 



35 460 



CVS ill CCA TTT 

Glu cys He Ala Gly Tyr Arg Pro Phe Leu His His Leu Gin Pro Phe 



470 



475 480 



Tr. T^ ^! ^° CCA AAG TGT ATA TTT GCA 



Thr- TT-n r.i„ t -m "^^^ TTT GCA 

Thr Trp His Glu Lys He Lys Lys Lys Asp Pro Lys Cys He Phe Ala 

490 495 



cys Glu Olu Met ser Gly Glu Val A^g ^ His L.. l7o .Tn 

"° 505 510 



Irt c , """^ '''''' ^^"^ CCC ATG GAA AAC TGG CTA 

Pro Asn ser Leu Cys Ser Leu He Val Glu Pro Met Glu Asn Trp Leu 

520 525 

CAG TTG ATG TTG AAT TGG GAC CCT CAG CAG AGA GGA GGA CCT GTT GAC 
Gin Leu Met Leu Asn Trp Asp Pro Gin Gin Arg Gly Gly Pro vH Asp 

540 

CTT ACT TTG AAG CAG CCA AGA TGT TTT GTA TTA ATG GAT CAC ATT TTG 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1486 



485 

45 Jvl G^ ^ ^GC cat TTA CCT CAA 153S 



1584 



1632 



1680 
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Leu Thr Leu Lys Gin Pro Arg Cys Phe Val Leu Met Asp His lie Leu 
545 550 555 560 

AAT TTG AAG ATA GTA CAC ATC CTA AAT ATG ACT TCT GCA AAG ATA ATT 1728 
5 Asn Leu Lys lie Val His lie Leu Asn Met Thr Ser Ala Lys lie lie 

565 570 575 

TCT TTT CTG TTA CCA CCT GAT GAA ACT CTT CAT TCA CTA CAG TCT CGT 1776 
Ser Phe Leu Leu Pro Pro Asp Glu Ser Leu His Ser Leu Gin Ser Arg 
10 580 585 590 

ATT GAG CGT GAA ACT GGA ATA AAT ACT GGT TCT CAA GAA CTT CTT TCA 1824 
lie Glu Arg Glu Thr Gly lie Asn Thr Gly Ser Gin Glu Leu Leu Ser 
595 600 605 

15 

GAG AC A GGA ATT TCT CTG GAT CCT CGG AAA CCA GCC TCT CAA TGT GTT 1872 
Glu Thr Gly He Ser Leu Asp Pro Arg Lys Pro Ala Ser Gin Cys Val 
610 615 620 

20 CTA GAT GGA GTT AGA GGC TGT GAT AGC TAT ATG GTT TAT TTG TTT GAT 1920 
Leu Asp Gly Val Arg Gly Cys Asp Ser Tyr Met Val Tyr Leu Phe Asp 
625 630 635 640 

AAA AGT AAA ACT GTA TAT GAA GGG CCA TTT GCT TCC AGA AGT TTA TCT 1968 
25 Lys Ser Lys Thr Val Tyr Glu Gly Pro Phe Ala Ser Arg Ser Leu Ser 

645 650 655 

GAT TGT GTA AAT TAT ATT GTA CAG GAC AGC AAA ATA CAG CTT CCA ATT 2016 
Asp Cys Val Asn Tyr He Val Gin Asp Ser Lys lie Gin Leu Pro He 
30 660 665 670 

ATA CAG CTG CGT AAA GTG TGG GCT GAA GCA GTG CAC TAT GTG TCT GGA 2 064 

He Gin Leu Arg Lys Val Trp Ala Glu Ala Val His Tyr Val Ser Gly 
675 680 685 

35 

CTA AAA GAA GAC TAT AGC AGG CTC TTT CAG GGA CAA AGG GCA GCA ATG 2112 
Leu Lys Glu Asp Tyr Ser Arg Leu Phe Gin Gly Gin Arg Ala Ala Met 
690 695 700 

40 TTA AGT CTT CTT AGA TAT AAT GCT AAC TTA ACA AAA ATG AAG AAC ACT 2160 

Leu Ser Leu Leu Arg Tyr Asn Ala Asn Leu Thr Lys Met Lys Asn Thr 
705 710 715 720 

TTG ATC TCA GCA TCA CAA CAA CTG AAA GCT AAA TTG GAG TTT TTT CAC 2208 
45 Leu He Ser Ala Ser Gin Gin Leu Lys Ala Lys Leu Glu Phe Phe His 

725 730 735 

AAA AGC ATT CAG CTT GAC TTG GAG AGA TAC AGC GAG CAG ATG ACG TAT 2256 
Lys Ser He Gin Leu Asp Leu Glu Arg Tyr Ser Glu Gin Met Thr Tyr 
50 740 745 750 

GGG ATA TCT TCA GAA AAA ATG CTA AAA GCA TGG AAA GAA ATG GAA GAA 2304 
Gly He Ser Ser Glu Lys Met Leu Lys Ala Trp Lys Glu Met Glu Glu 
755 760 765 



55 



AAG GCC ATC CAC TAT GCT GAG GTT GGT GTC ATT GGA TAC CTG GAG GAT 2352 
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Lys Ala He His Tyr Ala Glu Val Gly Val He Gly Tyr Leu Glu Asp 

77S 780 

tTI m!^ I" 7° '^^^ C-^^ AGC CCC 2400 

Gin He Met Ser Leu His Ala Glu He Met Gly Leu Gin Lys Ser Pro 

■'SO 795 800 

TAT GGA AGA CGT CAG GGA GAC TTG ATG GAA TCT CTG GAA CAG CGT GCC 2448 
Tyr Gly Arg Arg Gin Gly Asp Leu Met Glu Ser Leu Glu Gin Arg Ala 

810 ei5 

ATT GAT CTA TAT AAG CAG TTA AAA CAC AGA CCT TCA GAT CAC TCC TAG 2496 
He Asp Leu Tyr Lys Gin Leu Lys His Arg Pro Ser Asp His Ser Tyr 
820 825 830 



AGT GAC AGO ACA GAG ATG GTG AAA ATC ATT GTG CAC ACT GTG CAG AGT 2544 
ser Asp ser Thr Glu Met Val Lys He He Val His Thr Val Gin Ser 
835 840 845 



CAG GAC CGT GTG CTC AAG GAG CTG TTT GGT CAT TTG AGC AAG TTG TTG 2592 
Gin Asp Arg Val Leu Lys Glu Leu Phe Gly His Leu Ser Lys Leu Leu 
"° 855 860 



GGC TGT AAG CAG AAG ATT ATT GAT CTA CTC CCT AAG GTG GAA GTG GCC 2640 
Gly Cys Lys Gin Lys He He Asp Leu Leu Pro Lys Val Glu Val Ala 

"0 875 880 



CTC AGT AAT ATC AAA GAA GCT GAC AAT ACT GTC ATG TTC ATG CAG GGA 2688 
Leu ser Asn He Lys Glu Ala Asp Asn Thr Val Met Phe Met Gin Gly 

890 895 

AAA AGG CAG AAA GAA ATA TGG CAT CTC CTT AAA ATT GCC TGT ACA CAG 2736 
Lys Arg Gin Lys Glu He Trp His Leu Leu Lys He Ala Cys Thr Gin 

905 910 

AGT TCT GCC CGC TCT CTT GTA GGA TCC AGT CTA GAA GGT GCA GTA ACC 2784 
ser Ser Ala Arg Ser Leu Val Gly ser Ser Leu Glu Gly Ala Val Thr 
915 920 325 

PrI rf """^ '''''' ^'^^ CAT GAT CAT 2832 

Pro Gin Thr Ser Ala Trp Leu Pro Pro Thr Ser Ala Glu His Asp His 

935 940 

TCT CTG TCA TGT GTG GTA ACT CCT CAA GAT GGG GAG ACT TCA GCA CAA 
ser Leu Ser Cys Val Val Thr Pro Gin Asp Gly Glu Thr Ser Ala Gin 

3=0 955 960 

m!^ tT"" °f "^"^ "^"^ """^ TTA AGC ACT ATT ATT 2 928 

Met He Glu Glu Asn Leu Asn Cys Leu Gly His Leu Ser Thr He He 
3^5 970 975 

CAT GAG GCA AAT GAG GAA CAG GGC AAT AGT ATG ATG AAT CTT GAT TGG 2976 
Hxs Glu Ala Asn Glu Glu Gin Gly Asn Ser Met Met Asn Leu Asp Trp 

AGT TGG TTA ACA GAA TGA 



580 9^85 990 



2880 



2994 
230 
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Ser Trp Leu Thr Glu 
995 



5 (2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 997 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 





Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu 


Leu 


Phe 


Thr 


Gly Val 


Val 


Pro 


He 


Leu 




1 








5 










10 










15 




20 


Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val 


Asn 


Gly 


His 


Lys 


Phe 


Ser 


Val 


Ser 


Gly 










20 










25 










30 








Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


He 








35 










40 










45 










Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro 


Val 


Pro 


Trp 


Pro 


Thr 


Leu 


Val 


Thr 


Thr 


25 




50 










55 










60 












Leu 


Thr 


Tyr 


Gly 


Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 


Pro 


Asp 


His 


Met 


Lys 




65 










70 










75 










60 




Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly 


Tyr 


Val 


Gin 


Glu 












85 










90 










95 




30 


Arg 


Thr 


He 


Phe 


Phe 


Lys 


Asp 


Asp 


Gly 


Asn 


Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 










100 










105 










110 








Val 


Lys 


Phe 


Glu 


Gly 


Asp 


Thr 


Leu 


Val 


Asn 


Arg 


He 


Glu 


Leu 


Lys 


Gly 








115 










120 










125 










He 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly 


Asn 


He 


Leu 


Gly 


His 


Lys 


Leu 


Glu 


Tyr 


35 




130 










135 










140 












Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


He 


Met 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 




145 










150 










155 










160 




Gly 


He 


Lys 


Val 


Asn 


Phe 


Lys 


He 


Arg 


His 


Asn 


He 


Glu 


Asp 


Gly 


Ser 












165 










170 










175 




40 


Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


Pro 


He 


Gly 


Asp 


Gly 










180 










185 










190 








Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 


Thr 


Gin 


Ser 


Ala 


Leu 








195 










200 










205 










Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 


His 


Met 


Val 


Leu 


Leu 


Glu 


Phe 


45 




210 










215 










220 












Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr 


Leu 


Gly 


Met 


Asp 


Glu 


Leu 


Tyr 


Lys 


Ser 




225 










230 










235 










240 




Gly 


Leu 


Arg 


Ser 


Arg 


Ala 


Gin 


Ala 


Ser 


Asn 


Ser 


Thr 


Met 


Glu 


Arg 


Pro 












245 










250 










255 




50 


Pro 


Gly 


Leu 


Arg 


Pro 


Gly 


Ala 


Gly 


Gly 


Pro 


Trp 


Glu 


Met 


Arg 


Glu 


Arg 










260 










265 










270 








Leu 


Gly 


Thr 


Gly 


Gly 


Phe 


Gly 


Asn 


Val 


Cys 


Leu 


Tyr 


Gin 


His 


Arg 


Glu 








275 










280 










285 










Leu 


Asp 


Leu 


Lys 


He 


Ala 


He 


Lys 


Ser 


Cys 


Arg 


Leu 


Glu 


Leu 


Ser 


Thr 


55 




290 










295 










300 












Lys 


Asn 


Arg 


Glu 


Arg 


Trp 


Cys 


His 


Glu 


He 


Gin 


He 


Met 


Lys 


Lys 


Leu 
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305 

Asn His Ala Asn Val Val Lys Ala Cys Asp III Pro Glu Glu Leu A^n 

He Leu He His Asp Val Pro Leu Leu Ala Met Glu Tyr Cys Ser Gly 
340 



Gly Asp Leu Arg Lys Leu Leu Asn Lys Pro Glu Asn Cys "s Gly Leu 



Lys Glu Ser Gin He Leu Ser Leu Leu Ser Asp He Gly Ser Gly He 

Arg Tyr Leu His Glu Asn Lys He He His Arg Asp Leu Lys Pro Glu 

Asn He val Leu Gin Asp Val Gly Gly Lys He He His Lys He He 
405 



Asp Leu Gly Tyr Ala Lys Asp Val Asp cJn Gly Ser Leu Cys ill Ser 

Phe Val Gly Thr Leu Gin Tyr Leu 111 Pro Glu Leu Phe Glu Asn Lys 

440 

Pro Tyr Thr Ala Thr Val Asp Tyr Trp Ser Phe Gly Thr Met Val Phe 

455 460 
Glu cys He Ala Gly Tyr Arg Pro Phe Leu His His Leu Gin Pro Phe 
470 

Thr Trp His Glu Lys He Lys Lys Lys Asp Pro Lys Cys He Phe III 
^ -. 490 495 

Cys Glu Glu Met Ser Gly Glu Val Arg Phe Ser Ser His Leu Pro Gin 

505 510 
Pro Asn Ser Leu Cys Ser Leu He Val Glu Pro Met Glu Asn Trp Leu 

520 525 
Gin Leu Met Leu Asn Trp Asp Pro Gin Gin Arg Gly Gly Pro Val Asp 

Leu Thr Leu Lys Gin Pro Arg Cys Phe Val Leu M^t Asp His He Leu 

Asn Leu Lys He Val His He Leu Asn Met Thr Ser Ala Lys He He 

570 575 
Ser Phe Leu Leu Pro Pro Asp Glu Ser Leu His Ser Leu Gin Ser Arg 

585 590 
He Glu Arg Glu Thr Gly He Asn Thr Gly Ser Gin Glu Leu Leu Ser 

595 600 605 

Glu Thr Gly He Ser Leu Asp Pro Arg Lys Pro Ala Ser Gin Cys Val 

620 

Leu ASP Gly val Arg Gly Cys Asp Ser Tyr Met Val Tyr Leu Phe Asp 

Lys ser Lys Thr Val Tyr Glu Gly Pro Phe III Ser Arg Ser Leu Ser 

645 650 65^ 

ASP Cys val Asn Tyr He Val Gin Asp Ser Lys He Gin Leu Pro He 

ooO 



He Gin Leu Arg Lys Val Trp Ala g" Ala Val His Tyr vll Ser Gly 

680 685 
Leu Lys Glu ASP Tyr Ser Arg Leu Phe Gin Gly Gin Arg Ala Ala Met 

695 

Leu Ser Leu Leu Arg Tyr Asn Ala Asn Leu Thr Lys Met Lys Asn Thr 

Leu He ser Ala Ser Gin Gin Leu Lys Ala Lys Leu Glu Phe Phe His 

Lys ser He Gin Leu Asp Leu Glu Arg ijr Ser Glu Gin Met III Tyr 

0 745 
Gly He ser Ser Glu Lys Met Leu Lys Ala Trp Lys Glu Me^ Glu Glu 
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755 760 765 

Lys Ala lie His Tyr Ala Glu Val Gly Val lie Gly Tyr Leu Glu Asp 

770 775 780 

Gin lie Met Ser Leu His Ala Glu lie Met Gly Leu Gin Lys Ser Pro 
5 785 790 795 800 

Tyr Gly Arg Arg Gin Gly Asp Leu Met Glu Ser Leu Glu Gin Arg Ala 

805 810 815 

lie Asp Leu Tyr Lys Gin Leu Lys His Arg Pro Ser Asp His Ser Tyr 
820 825 830 

10 Ser Asp Ser Thr Glu Met Val Lys lie lie Val His Thr Val Gin Ser 
835 840 845 

Gin Asp Arg Val Leu Lys Glu Leu Phe Gly His Leu Ser Lys Leu Leu 

850 855 860 • 

Gly Cys Lys Gin Lys lie lie Asp Leu Leu Pro Lys Val Glu Val Ala 
15 865 870 875 880 

Leu Ser Asn lie Lys Glu Ala Asp Asn Thr Val Met Phe Met Gin Gly 

885 890 895 

Lys Arg Gin Lys Glu lie Trp His Leu Leu Lys lie Ala Cys Thr Gin 
900 905 910 

20 Ser Ser Ala Arg Ser Leu Val Gly Ser Ser Leu Glu Gly Ala Val Thr 
915 920 925 

Pro Gin Thr Ser Ala Trp Leu Pro Pro Thr Ser Ala Glu His Asp His 

930 935 940 

Ser Leu Ser Cys Val Val Thr Pro Gin Asp Gly Glu Thr Ser Ala Gin 
25 945 950 955 960 

Met lie Glu Glu Asn Leu Asn Cys Leu Gly His Leu Ser Thr lie lie 

965 970 975 

His Glu Ala Asn Glu Glu Gin Gly Asn Ser Met Met Asn Leu Asp Trp 
980 985 990 

30 Ser Trp Leu Thr Glu 
995 

(2) INFORMATION FOR SEQ ID NO: 12 2: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2991 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 
45 (B) LOCATION: 1...2988 

(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

50 ATG GAG CGG CCC CCG GGG CTG CGG CCG GGC GCG GGC GGG CCC TGG GAG 4 8 

Met Glu Arg Pro Pro Gly Leu Arg Pro Gly Ala Gly Gly Pro Trp Glu 
15 10 15 

ATG CGG GAG CGG CTG GGC ACC GGC GGC TTC GGG AAC GTC TGT CTG TAC 96 
55 Met Arg Glu Arg Leu Gly Thr Gly Gly Phe Gly Asn Val Cys Leu Tyr 
20 25 30 
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CAG CAT CGG GAA CTT GAT CTC AAA ATA GCA ATT AAG TCT TGT CGC CTA 144 
Gin His Arg Glu Leu Asp Leu Lys He Ala He Lys Ser Cys Arg Leu 
35 40 45 

GAG CTA AGT ACC AAA AAC AGA GAA CGA TGG TGC CAT GAA ATC CAG ATT 192 
Glu Leu Ser Thr Lys Asn Arg Glu Arg Trp Cys His Glu He Gin He 
50 55 60 

ATG AAG AAG TTG AAC CAT GCC AAT GTT GTA AAG GCC TGT GAT GTT CCT 240 
Met Lys Lys Leu Asn His Ala Asn Val Val Lys Ala Cys Asp Val Pro 
^5 70 75 80 

GAA GAA TTG AAT ATT TTG ATT CAT GAT GTG CCT CTT CTA GCA ATG GAA 2 88 

Glu Glu Leu Asn He Leu He His Asp Val Pro Leu Leu Ala Met Glu 
85 90 95 

TAC TGT TCT GGA GGA GAT CTC CGA AAG CTG CTC AAC AAA CCA GAA AAT 336 
Tyr Cys Ser Gly Gly Asp Leu Arg Lys Leu Leu Asn Lys Pro Glu Asn 
100 105 110 

TGT TGT GGA CTT AAA GAA AGC CAG ATA CTT TCT TTA CTA AGT GAT ATA 3 84 

Cys Cys Gly Leu Lys Glu Ser Gin He Leu Ser Leu Leu Ser Asp He 
115 120 125 

GGG TCT GGG ATT CGA TAT TTG CAT GAA AAC AAA ATT ATA CAT CGA GAT 432 
Gly Ser Gly He Arg Tyr Leu His Glu Asn Lys He He His Arg Asp 
130 135 140 

CTA AAA CCT GAA AAC ATA GTT CTT CAG GAT GTT GGT GGA AAG ATA ATA 480 
Leu Lys Pro Glu Asn He Val Leu Gin Asp Val Gly Gly Lys He He 

150 155 160 

CAT AAA ATA ATT GAT CTG GGA TAT GCC AAA GAT GTT GAT CAA GGA AGT 52 8 

His Lys He He Asp Leu Gly Tyr Ala Lys Asp Val Asp Gin Gly Ser 
165 170 175 

CTG TGT ACA TCT TTT GTG GGA ACA CTG CAG TAT CTG GCC CCA GAG CTC 576 
Leu Cys Thr Ser Phe Val Gly Thr Leu Gin Tyr Leu Ala Pro Glu Leu 
180 185 190 

TTT GAG AAT AAG CCT TAC ACA GCC ACT GTT GAT TAT TGG AGC TTT GGG 624 
Phe Glu Asn Lys Pro Tyr Thr Ala Thr Val Asp Tyr Trp Ser Phe Gly 
195 200 205 

ACC ATG GTA TTT GAA TGT ATT GCT GGA TAT AGG CCT TTT TTG CAT CAT 672 
Thr Met Val Phe Glu Cys He Ala Gly Tyr Arg Pro Phe Leu His His 
210 215 220 

CTG CAG CCA TTT ACC TGG CAT GAG AAG ATT AAG AAG AAG GAT CCA AAG 72 0 

Leu Gin Pro Phe Thr Trp His Glu Lys He Lys Lys Lys Asp Pro Lys 
225 230 235 240 

TGT ATA TTT GCA TGT GAA GAG ATG TCA GGA GAA GTT CGG TTT AGT AGC 768 
Cys He Phe Ala Cys Glu Glu Met Ser Gly Glu Val Arg Phe Ser Ser 
245 250 255 



234 



SUBSTITUTE SHEFT (RULE 26) 



wo 98/45704 PCT/DK98/00145 

235 

CAT TTA CCT CAA CCA AAT AGC CTT TGT AGT TTA ATA GTA GAA CCC ATG 816 
His Leu Pro Gin Pro Asn Ser Leu Cys Ser Leu He Val Glu Pro Met 
260 265 270 

5 

GAA AAC TGG CTA GAG TTG ATG TTG AAT TGG GAG CCT CAG CAG AGA GGA 864 
Glu Asn Trp Leu Gin Leu Met Leu Asn Trp Asp Pro Gin Gin Arg Gly 
275 280 285 

10 GGA CCT GTT GAC CTT ACT TTG AAG CAG CCA AGA TGT TTT GTA TTA ATG 912 
Gly Pro Val Asp Leu Thr Leu Lys Gin Pro Arg Cys Phe Val Leu Met 
290 295 300 

GAT CAC ATT TTG AAT TTG AAG ATA GTA CAC ATC CTA AAT ATG ACT TCT 960 
15 Asp His He Leu Asn Leu Lys He Val His He Leu Asn Met Thr Ser 
305 310 315 320 

GCA AAG ATA ATT TCT TTT CTG TTA CCA CCT GAT GAA AGT CTT CAT TCA 1008 
Ala Lys He He Ser Phe Leu Leu Pro Pro Asp Glu Ser Leu His Ser 
20 325 330 335 

CTA CAG TCT CGT ATT GAG CGT GAA ACT GGA ATA AAT ACT GGT TCT CAA 1056 

Leu Gin Ser Arg He Glu Arg Glu Thr Gly He Asn Thr Gly Ser Gin 

340 345 350 

25 

GAA CTT CTT TCA GAG ACA GGA ATT TCT CTG GAT CCT CGG AAA CCA GCC 1104 

Glu Leu Leu Ser Glu Thr Gly He Ser Leu Asp Pro Arg Lys Pro Ala 
355 360 365 

30 TCT CAA TGT GTT CTA GAT GGA GTT AGA GGC TGT GAT AGC TAT ATG GTT 1152 
Ser Gin Cys Val Leu Asp Gly Val Arg Gly Cys Asp Ser Tyr Met Val 
370 375 380 

TAT TTG TTT GAT AAA AGT AAA ACT GTA TAT GAA GGG CCA TTT GCT TCC 1200 
35 Tyr Leu Phe Asp Lys Ser Lys Thr Val Tyr Glu Gly Pro Phe Ala Ser 
385 390 395 400 

AGA AGT TTA TCT GAT TGT GTA AAT TAT ATT GTA CAG GAC AGC AAA ATA 124 8 

Arg Ser Leu Ser Asp Cys Val Asn Tyr He Val Gin Asp Ser Lys He 
40 405 410 415 

CAG CTT CCA ATT ATA CAG CTG CGT AAA GTG TGG GCT GAA GCA GTG CAC 1296 

Gin Leu Pro He He Gin Leu Arg Lys Val Trp Ala Glu Ala Val His 
420 425 430 

45 

TAT GTG TCT GGA CTA AAA GAA GAC TAT AGC AGG CTC TTT CAG GGA CAA 1344 

* Tyr Val Ser Gly Leu Lys Glu Asp Tyr Ser Arg Leu Phe Gin Gly Gin 
435 440 445 

50 AGG GCA GCA ATG TTA AGT CTT CTT AGA TAT AAT GCT AAC TTA ACA AAA 13 92 

Arg Ala Ala Met Leu Ser Leu Leu Arg Tyr Asn Ala Asn Leu Thr Lys 
450 455 460 

ATG AAG AAC ACT TTG ATC TCA GCA TCA CAA CAA CTG AAA GCT AAA TTG 144 0 

55 Met Lys Asn Thr Leu He Ser Ala Ser Gin Gin Leu Lys Ala Lys Leu 
465 470 475 480 
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GAG TTT TTT CAC AAA AGC ATT CAG CTT GAC TTG GAG AGA TAG AGC GAG 14 88 

Glu Phe Phe His Lys Ser lie Gin Leu Asp Leu Glu Arg Tyr Ser Glu 
485 490 495 

CAG ATG ACG TAT GGG ATA TOT TCA GAA AAA ATG CTA AAA GCA TGG AAA 1536 
Gin Met Thr Tyr Gly lie Ser Ser Glu Lys Met Leu Lys Ala Trp Lys 
500 505 510 

GAA ATG GAA GAA AAG GCC ATC CAC TAT GCT GAG GTT GGT GTC ATT GGA 1584 
Glu Met Glu Glu Lys Ala lie His Tyr Ala Glu Val Gly Val lie Gly 
515 520 525 

TAG CTG GAG GAT CAG ATT ATG TCT TTG CAT GCT GAA ATC ATG GGG CTA 1632 
Tyr Leu Glu Asp Gin He Met Ser Leu His Ala Glu He Met Gly Leu 
530 535 540 



CAG AAG AGC CCC TAT GGA AGA CGT CAG GGA GAC TTG ATG GAA TCT CTG 1680 
Gin Lys Ser Pro Tyr Gly Arg T^g Gin Gly Asp Leu Met Glu Ser Leu 
20 545 550 555 560 

GAA CAG CGT GCC ATT GAT CTA TAT AAG CAG TTA AAA CAC AGA CCT TCA 1728 
Glu Gin Arg Ala He Asp Leu Tyr Lys Gin Leu Lys His Arg Pro Ser 
565 570 575 



GAT CAC TCC TAG AGT GAC AGC ACA GAG ATG GTG AAA ATC ATT GTG CAC 1776 
Asp His Ser Tyr Ser Asp Ser Thr Glu Met Val Lys He He Val His 
580 585 590 

ACT GTG CAG AGT CAG GAC CGT GTG CTC AAG GAG CTG TTT GGT CAT TTG 1824 
Thr Val Gin Ser Gin Asp Arg Val Leu Lys Glu Leu Phe Gly His Leu 
595 600 605 

AGC AAG TTG TTG GGC TGT AAG CAG AAG ATT ATT GAT CTA CTC CCT AAG 1872 
Ser Lys Leu Leu Gly Cys Lys Gin Lys He He Asp Leu Leu Pro Lys 
^10 615 620 



GTG GAA GTG GCC CTC AGT AAT ATC AAA GAA GCT GAC AAT ACT GTC ATG 1920 
val Glu val Ala Leu Ser Asn He Lys Glu Ala Asp Asn Thr Val Met 
^0 ^25 630 635 640 

TTC ATG CAG GGA AAA AGG CAG AAA GAA ATA TGG CAT CTC CTT AAA ATT 1968 
Phe Met Gin Gly Lys Arg Gin Lys Glu He Trp His Leu Leu Lys He 
645 650 655 



GCC TGT ACA CAG AGT TCT GCC CGC TCT CTT GTA GGA TCC AGT CTA GAA 2016 
Ala Cys Thr Gin Ser Ser Ala Arg Ser Leu Val Gly Ser Ser Leu Glu 
660 665 670 

GGT GCA GTA ACC CCT CAG ACA TCA GCA TGG CTG CCC CCG ACT TCA GCA 2064 
Gly Ala Val Thr Pro Gin Thr Ser Ala Trp Leu Pro Pro Thr Ser Ala 
675 680 685 

GAA CAT GAT CAT TCT CTG TCA TGT GTG GTA ACT CCT CAA GAT GGG GAG 2112 
Glu His Asp His Ser Leu Ser Cys Val Val Thr Pro Gin Asp Gly Glu 
690 695 700 
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ACT TCA GCA CAA ATG ATA GAA GAA AAT TTG AAC TGC CTT GGC CAT TTA 2160 

Thr Ser Ala Gin Met He Glu Glu Asn Leu Asn Cys Leu Gly His Leu 
705 710 715 720 

5 

AGC ACT ATT ATT CAT GAG GCA AAT GAG GAA CAG GGC AAT AGT ATG ATG 2208 

Ser Thr He He His Glu Ala Asn Glu Glu Gin Gly Asn Ser Met Met 
725 730 735 

10 AAT CTT GAT TGG AGT TGG TTA ACA GAA TGG GTA CCG CGG GCC CGG GAT 2256 
Asn Leu Asp Trp Ser Trp Leu Thr Glu Trp Val Pro Arg Ala Arg Asp 
740 745 750 

CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG 23 04 
15 Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
755 760 765 

GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG 2352 
Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
20 770 775 780 

TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG 24 00 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
785 790 795 800 

25 

ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC 2448 

Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
805 810 815 

30 ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC 2496 
Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 
820 825 830 

CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA 2544 
35 Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
835 840 845 

GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC 2592 
Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
40 850 855 860 

AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC 2640 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 

865 870 875 880 

45 

ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG 2688 

He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 

885 890 895 

50 CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC 2736 

His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala 
900 905 910 

GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC 27 84 
55 Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 
915 920 925 
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ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC 2832 

He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 

935 940 

CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC 2 880 
Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 

550 955 960 

ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG 2928 
Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
565 970 

GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC 2976 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 

985 990 



GAG CTG TAC AAG TAA 
Glu Leu Tyr Lys 
995 



2991 



(2) INFORMATION FOR SEQ ID NO:123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 996 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 

Met Glu Arg Pro Pro Gly Leu Arg Pro Gly Ala Gly Gly Pro Trp Glu 

^ 10 15 

Met Arg Glu Arg Leu Gly Thr Gly Gly Phe Gly Asn Val Cys Leu Tyr 

20 25 30 

Gin His Arg Glu Leu Asp Leu Lys lie Ala He Lys Ser Cys Arg Leu 

35 40 45 

Glu Leu ser Thr Lys Asn Arg Glu Arg Trp Cys His Glu He Gin He 
« 60 

Met Lys Lys Leu Asn His Ala Asn Val Val Lys Ala Cys Asp Val Pro 

70 75 
Glu Glu Leu Asn He Leu He His Asp Val Pro Leu Leu Ala Met Glu 

85 90 95 

Tyr Cys Ser Gly Gly Asp Leu Arg Lys Leu Leu Asn Lys Pro Glu Asn 

105 110 
Cys Cys Gly Leu Lys Glu Ser Gin He Leu Ser Leu Leu Ser Asp He 

115 120 125 

Gly Ser Gly He Arg Tyr Leu His Glu Asn Lys He He His Arg Asp 
130 135 

Leu Lys Pro Glu Asn He Val Leu Gin Asp Val Gly Gly Lys He He 

155 igo 
His Lys He He Asp Leu Gly Tyr Ala Lys Asp Val Asp Gin Gly Ser 
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170 










175 






Leu 


Cys 


Thr 


Ser 
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Val 


Gly 
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Gin 


Tyr 
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Pro 


Glu 
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185 










190 








Phe 


Glu 


Asn 


Lys 


Pro 


Tyr 


Thr 


Ala 


Thr 


Val 


Asp 


Tyr 


Trp 


Ser 


Phe 


Gly 
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195 










200 










205 










Thr 


Met 


Val 


Phe 


Glu 


Cys 


He 


Ala 


Gly Tyr 


Arg 


Pro 


Phe 


Leu 


His 


His 






210 










215 










220 
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Gin 


Pro 


Phe 


Thr 


Trp 


His 


Glu 


Lys 


He 


Lys 


Lys 


Lys 


Asp 


Pro 


Lys 




225 
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10 


Cys 
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Cys 


Glu 


Glu 


Met 


Ser Gly 


Glu 


Val 


Arg 


Phe 


Ser 


Ser 
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Leu 


Cys 
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Leu 
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Val 


Glu 


Pro 


Met 
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270 








Glu 


Asn 


Trp 


Leu 


Gin 


Leu 


Met 


Leu 
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Trp 


Asp 


Pro 


Gin 


Gin 


Arg 


Gly 


15 
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Pro 


Val 


Asp 


Leu 


Thr 
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Gin 


Pro 


Arg 
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Phe 


Val 


Leu 
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His 


lie 
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Lys 


He 


Val 


His 


He 
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Met 


Thr 


Ser 
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20 


Ala 


Lys 


He 


He 


Ser 


Phe 


Leu 


Leu 


Pro 


Pro 


Asp 


Glu 


Ser 


Leu 


His 


Ser 












325 










330 










335 






Leu 


Gin 


Ser 


Arg 


He 


Glu 


Arg 


Glu 


Thr 


Gly 


He 


Asn 


Thr 


Gly 


Ser 


Gin 










340 










345 
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Glu 
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Leu 


Ser 


Glu 


Thr Gly 


He 


Ser 


Leu 


Asp 


Pro 


Arg 


Lys 


Pro 


Ala 


25 
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Gin 


Cys 


Val 


Leu 


Asp 


Gly Val 
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Gly 


Cys 


Asp 


Ser 


Tyr 


Met 


Val 
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Lys 
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Thr 
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Glu Gly 
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He 
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470 
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40 
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490 










495 
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Thr 
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He 
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Met 
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Trp 
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500 
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Met 


Glu 
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Lys 
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He 


His 


Tyr 
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Glu 


Val 


Gly Val 


He 


Gly 
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610 615 620 

Val Glu Val Ala Leu Ser Asn He Lys Glu Ala Asp Asn Thr Val Met 

630 635 640 

Phe Met Gin Gly Lys Arg Gin Lys Glu He Trp His Leu Leu Lys He 

€45 650 655 

Ala Cys Thr Gin Ser Ser Ala Arg Ser Leu Val Gly Ser Ser Leu Glu 

660 665 670 

Gly Ala Val Thr Pro Gin Thr Ser Ala Trp Leu Pro Pro Thr Ser Ala 

675 680 685 

Glu His Asp His Ser Leu Ser Cys Val Val Thr Pro Gin Asp Gly Glu 

690 695 700 

Thr Ser Ala Gin Met He Glu Glu Asn Leu Asn Cys Leu Gly His Leu 
705 710 715 720 

Ser Thr He He His Glu Ala Asn Glu Glu Gin Gly Asn Ser Met Met 

725 730 735 

Asn Leu Asp Trp Ser Trp Leu Thr Glu Trp Val Pro Arg Ala Arg Asp 

740 745 
Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 

755 760 765 

Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 

770 775 780 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
■^^5 790 795 800 

Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
25 805 810 815 

Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 

820 825 830 

Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
835 840 845 

30 Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
850 855 860 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 

870 875 880 

He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 

885 890 895 

His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala 

900 905 910 

Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 
915 920 925 

40 He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
930 935 940 

Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 

950 955 960 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
45 965 970 975 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 

980 985 990 

Glu Leu Tyr Lys 
995 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1908 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
( ix) FEATURE : 

5 

(A) NANE/KEY: Coding Sequence 

(B) LOCATION: 1 . . . 1905 
(D) OTHER INFORMATION: 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

20 GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
25 Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
30 65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

35 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 3 36 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

40 GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 4 32 

45 lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
50 145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 



55 
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Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 



190 



Pro Su IT p'''' f ^^"^ ^'^^ '^C'^ ^ TCC GCC CTG 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAG AAG TCC 
val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr sir 
"° 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCC ATG AGC GAG ACG GTC ATC ATG 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Met Ser Glu Thr Val He 
245 250 255 

ser liu ?hr vl? tr ""^^ '^^^ GAT 

ser Glu Thr Val He Cys Ser Ser Arg Ala Thr Val Met Leu Tyr Asp 

260 265 270 

GAT GGC AAC AAG CGA TGG CTC COT GCT GGC ACG GGT CCC CAG GCC TTC 
ASP Gly Asn Lys Arg Trp Leu Pro Ala Gly Thr Gly Pro Gin Ala Phe 
275 280 285 

AGC CGC GTC CAG ATC TAC CAC AAC CCC ACG GCC AAT TCC TTT CGC GTC 
ser Arg Val Gin He Tyr His Asn Pro Thr Ala Asn Ser S ^ll 

290 295 

GTG GGC CGG AAG ATG CAG CCC GAC CAG CAG GTG GTC ATC AAC TGT GCC 
val Gly Arg Lys Met Gin Pro Asp Gin Gin Val Val He Asn Cys Ala 

315 320 

ill vTl Aro f ^ ™^ '^'^ ^'^'^ ^'^'^ CAT CAG 

He val Arg Gly Val Lys Tyr Asn Gin Ala Thr Pro Asn Phe His Gin 

330 335 

S A« Asn All f ""^^ ^C GGC AGC AAG GAG 

Trp Arg Asp Ala Arg Gin Val Trp Gly Leu Asn Phe Gly Ser Lys Glu 

345 350 

Zl til A?"" T ^"^"^ A<^T <^CC CTA GAG GCG TTG 

Asp Ala Ala Gin Phe Ala Ala Gly Met Ala Ser Ala Leu Glu Ala Leu 



355 



365 



G?v r7 '''''' ''''^ ^'^'^ "^^^ TCG GTC 

Glu Gly Gly Gly Pro Pro Pro Pro Pro Ala Leu Pro Thr Trp Ser Val 

375 

Pro G?*" p'''' f ''''^ ^""'^ ""^^ ^ CAG CAG 

Pro Asn Gly Pro Ser Pro Glu Glu Val Glu Gin Gin Lys Arg Gin Gin 

395 

CCC GGC CCG TCG GAG CAC ATA GAG CGC CGG GTC TCC AAT GCA GGA GGC 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 
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Pro Gly Pro Ser Glu His lie Glu Arg Arg Val Ser Asn Ala Gly Gly 
405 410 415 

CCA CCT OCT CCC CCC GCT GGG GGT CCA CCC CCA CCA CCA GGA CCT CCC 1296 
5 Pro Pro Ala Pro Pro Ala Gly Gly Pro Pro Pro Pro Pro Gly Pro Pro 
420 425 430 

CCT CCT CCA GGT CCC CCC CCA CCC CCA GGT TTG CCC CCT TCG GGG GTC 1344 
Pro Pro Pro Gly Pro Pro Pro Pro Pro Gly Leu Pro Pro Ser Gly Val 
10 435 440 445 

CCA GCT GCA GCG CAC GGA GCA GGG GGA GGA CCA CCC CCT GCA CCC CCT 1392 

Pro Ala Ala Ala His Gly Ala Gly Gly Gly Pro Pro Pro Ala Pro Pro 

450 455 460 

15 

CTC CCG GCA GCA CAG GGC CCT GGT GGT GGG GGA GCT GGG GCC CCA GGC 144 0 

Leu Pro Ala Ala Gin Gly Pro Gly Gly Gly Gly Ala Gly Ala Pro Gly 
465 470 475 480 

20 CTG GCC GCA GCT ATT GCT GGA GCC AAA CTC AGG AAA GTC AGC AAG CAG 14 88 

Leu Ala Ala Ala lie Ala Gly Ala Lys Leu Arg Lys Val Ser Lys Gin 
485 490 495 

GAG GAG GCC TCA GGG GGG CCC ACA GCC CCC AAA GCT GAG AGT GGT CGA 1536 
25 Glu Glu Ala Ser Gly Gly Pro Thr Ala Pro Lys Ala Glu Ser Gly Arg 
500 505 510 

AGC GGA GGT GGG GGA CTC ATG GAA GAG ATG AAC GCC ATG CTG GCC CGG 1584 
Ser Gly Gly Gly Gly Leu Met Glu Glu Met Asn Ala Met Leu Ala Arg 
30 515 520 525 

AGA AGG AAA GCC ACG CAA GTT GGG GAG AAA ACC CCC AAG GAT GAA TCT 1632 

Arg Arg Lys Ala Thr Gin Val Gly Glu Lys Thr Pro Lys Asp Glu Ser 
530 535 540 

35 

GCC AAT CAG GAG GAG CCA GAG GCC AGA GTC CCG GCC CAG AGT GAA TCT 1680 

Ala Asn Gin Glu Glu Pro Glu Ala Arg Val Pro Ala Gin Ser Glu Ser 
545 550 555 560 

40 GTG CGG AGA CCC TGG GAG AAG AAC AGC ACA ACC TTG CCA AGG ATG AAG 1728 

Val Arg Arg Pro Trp Glu Lys Asn Ser Thr Thr Leu Pro Arg Met Lys 
565 570 575 

TCG TCT TCT TCG GTG ACC ACT TCC GAG ACC CAA CCC TGC ACG CCC AGC 1776 
45 Ser Ser Ser Ser Val Thr Thr Ser Glu Thr Gin Pro Cys Thr Pro Ser 
580 585 590 

TCC AGT GAT TAG TCG GAC CTA CAG AGG GTG AAA CAG GAG CTT CTG GAA 1824 
Ser Ser Asp Tyr Ser Asp Leu Gin Arg Val Lys Gin Glu Leu Leu Glu 
50 595 600 605 

GAG GTG AAG AAG GAA TTG CAG AAA GTG AAA GAG GAA ATC ATT GAA GCC 1872 
Glu Val Lys Lys Glu Leu Gin Lys Val Lys Glu Glu lie lie Glu Ala 
610 615 620 



55 



TTC GTC CAG GAG CTG AGG AAG CGG GGT TCT CCC TGA 1908 

243 
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Phe Val Gin Glu Leu Arg Lys Arg Gly Ser Pro 
625 630 635 



(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 635 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

5° 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
^5 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

120 125 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Met Ser Glu Thr Val He Met 

245 250 255 

Ser Glu Thr Val He Cys Ser Ser Arg Ala Thr Val Met Leu Tyr Asp 

260 265 270 

Asp Gly Asn Lys Arg Trp Leu Pro Ala Gly Thr Gly Pro Gin Ala Phe 

275 280 285 

Ser Arg Val Gin He Tyr His Asn Pro Thr Ala Asn Ser Phe Arg Val 

290 295 300 

val Gly Arg Lys Met Gin Pro Asp Gin Gin Val Val He Asn Cys Ala 
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J U D 










J XU 




















320 






vax 


Arg 


Gly 


vaj. 


Lys 


Tyr 


Asn 


Gxn 


AX a 


Thr 


Pro 


Asn 


Pne 


His 


Gin 












•a 1 c 










33 0 










335 






Trp 


Arg 


Asp 


Ala 


Arg 


Gin 


Val 


Trp 


Gly 


Leu 


Asn 


Phe Gly 


Ser 


Lys 


Glu 


O 








340 










345 










350 








Asp 


Ala 


Ala 


Gin 


Phe 


Ala 


Ala 


Gly 


Met 


Ala 


Ser 


Ala 


Leu 


Glu 


Ala 


Leu 








355 










360 










365 










Glu 


Gly 


Gly 


Gly 


Pro 


Pro 


Pro 


Pro 


Pro 


Ala 


Leu 


Pro 


Thr 


Trp 


Ser 


Val 






1 T A 










1 T C 










380 










lO 


Pro 


Asn 


Gly 


Pro 


Ser 


Pro 


Glu 


Glu 


Val 


Glu 


Gin 


Gin 


Lys 


Arg 


Gin 


Gin 




365 










3 90 










395 










400 


• 


Pro 


Gly 


Pro 


Ser 


Glu 


His 


He 


Glu 


Arg 


Arg 


val 


Ser 


Asn 


Ala 


Gly Gly 












4 05 










410 










415 






Pro 


Pro 


Ala 


Pro 


Pro 


Ala 


Gly 


Gly 


Pro 


Pro 


Pro 


Pro 


Pro 


Gly 


Pro 


Pro 


lO 








42 0 










425 










430 








Pro 


Pro 


Pro 


Gly 


Pro 


Pro 


Pro 


Pro 


Pro 


Gly 


Leu 


Pro 


Pro 


Ser 


Gly 


Val 








4 35 










44 0 










445 










Pro 


Ala 


Ala 


Ala 


His 


Gly 


Ala 


Gly 


Gly 


Gly 


Pro 


Pro 


Pro 


Ala 


Pro 


Pro 






450 










455 










460 










Oft 


Leu 


Pro 


Ala 


Ala 


Gin 


Gly 


Pro 


Gly 


Gly 


Gly 


Gly Ala Gly Ala 


Pro 


Gly 














470 










475 










480 




Leu 


Ala 


Ala 


Ala 


lie 


Ala 


Gly 


Ala 


Lys 


Leu 


Arg 


Lys 


Val 


Ser 


Lys 


Gin 












485 










490 










495 






Glu 


Glu 


Ala 


Ser 


Gly 


Gly 


Pro 


Thr 


Ala 


Pro 


Lys 


Ala 


Glu 


Ser Gly Arg 










500 










505 










510 








Ser 


Gly 


Gly 


Gly 


Gly 


Leu 


Met 


Glu 


Glu 


Met 


Asn 


Ala 


Met 


Leu 


Ala 


Arg 








515 










520 










525 










Arg 


Arg 


Lys 


Ala 


Thr 


Gin 


Val 


Gly 


Glu 


Lys 


Thr 


Pro 


Lys 


Asp 


Glu 


Ser 






530 










535 










540 










30 


Ala 


Asn 


Gin 


Glu 


Glu 


Pro 


Glu 


Ala 


Arg 


Val 


Pro 


Ala 


Gin 


Ser 


Glu 


Ser 




545 










550 










555 










560 




Val 


Arg 


Arg 


Pro 


Trp 


Glu 


Lys 


Asn 


Ser 


Thr 


Thr 


Leu 


Pro 


Arg 


Met 


Lys 












565 










570 










575 






Ser 


Ser 


Ser 


Ser 


Val 


Thr 


Thr 


Ser 


Glu 


Thr 


Gin 


Pro 


Cys 


Thr 


Pro 


Ser 


35 








580 










585 










590 








Ser 


Ser 


Asp 


Tyr 


Ser 


Asp 


Leu 


Gin 


Arg 


Val 


Lys 


Gin 


Glu 


Leu 


Leu 


Glu 








595 










600 










605 










Glu 


Val 


Lys 


Lys 


Glu 


Leu 


Gin 


Lys 


Val 


Lys 


Glu 


Glu 


He 


He 


Glu 


Ala 






610 










615 










620 










40 


Phe 


Val 


Gin 


Glu 


Leu 


Arg 


Lys 


Arg 


Gly 


Ser 


Pro 













625 630 635 



(2) INFORMATION FOR SEQ ID NO: 126: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
55 (B) LOCATION: 1...1326 

(D) OTHER INFORMATION: 

245 
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15 



20 



30 



35 



40 



45 



50 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
^ 5 10 



CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAG CCC GAC CAC ATG AAG 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
" 7° 75 si 



CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 
val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 
lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Glv 
180 185 190 ^ 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 



48 



10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TG6 CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



144 



240 



25 mn h'''' T"" ir ^"^"^ ""^^ ''^^ ^TC CAG GAG 288 

Gin H:ls Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



336 



384 



432 



480 



528 



576 



624 



672 

246 
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Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Jdeu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAG GAG CTG TAG AAG TOO 720 
5 Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCA ATG GOT GCC ATC CGG AAG AAA 768 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Met Ala Ala He Arg Lys Lys 
10 245 250 255 

CTG GTG ATT GTT GGT GAT GGA GCC TGT GGA AAG ACA TGC TTG CTC ATA 816 

Leu Val He Val Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu He 

260 265 270 

15 

GTC TTC AGC AAG GAC CAG TTC CCA GAG GTG TAT GTG CCC ACA GTG TTT 864 

Val Phe Ser Lys Asp Gin Phe Pro Glu Val Tyr Val Pro Thr Val Phe 

275 280 285 

20 GAG AAC TAT GTG GCA GAT ATC GAG GTG GAT GGA AAG CAG GTA GAG TTG 912 
Glu Asn Tyr Val Ala Asp He Glu Val Asp Gly Lys Gin Val Glu Leu 
290 295 300 

GCT TTG TGG GAC ACA GCT GGG CAG GAA GAT TAT GAT CGC CTG AGG CCC 960 
25 Ala Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro 
305 310 315 320 

CTC TCC TAC CCA GAT ACC GAT GTT ATA CTG ATG TGT TTT TCC ATC GAC 1008 
Leu Ser Tyr Pro Asp Thr Asp Val He Leu Met Cys Phe Ser He Asp 
30 325 330 335 

AGC CCT GAT AGT TTA GAA AAC ATC CCA GAA AAG TGG ACC CCA GAA GTC 1056 

Ser Pro Asp Ser Leu Glu Asn He Pro Glu Lys Trp Thr Pro Glu Val 

340 345 350 

35 

AAG CAT TTC TGT CCC AAC GTG CCC ATC ATC CTG GTT GGG AAT AAG AAG 1104 

Lys His Phe Cys Pro Asn Val Pro He He Leu Val Gly Asn Lys Lys 
355 360 365 

40 GAT CTT CGG AAT GAT GAG CAC ACA AGG CGG GAG CTA GCC AAG ATG AAG 1152 

Asp Leu Arg Asn Asp Glu His Thr Arg Arg Glu Leu Ala Lys Met Lys 
370 375 380 

CAG GAG CCG GTG AAA CCT GAA GAA GGC AGA GAT ATG GCA AAC AGG ATT 1200 
45 Gin Glu Pro Val Lys Pro Glu Glu Gly Arg Asp Met Ala Asn Arg He 
385 390 395 400 

GGC GCT TTT GGG TAC ATG GAG TGT TCA GCA AAG ACC AAA GAT GGA GTG 124 8 

Gly Ala Phe Gly Tyr Met Glu Cys Ser Ala Lys Thr Lys Asp Gly Val 
50 405 410 415 

AGA GAG GTT TTT GAA ATG GCT ACG AGA GCT GCT CTG CAA GCT AGA CGT 12 96 

Arg Glu Val Phe Glu Met Ala Thr Arg Ala Ala Leu Gin Ala Arg Arg 
420 425 430 



55 



GGG AAG AAA AAA TCT GGT TGC CTT GTC TTG TGA 132 9 
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Gly Lys Lys Lys Ser Gly Cys Leu Val Leu 
435 440 



(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 

230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Met Ala Ala He Arg Lys Lys 

245 250 255 

Leu Val He Val Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu He 

2S0 265 270 

Val Phe Ser Lys Asp Gin Phe Pro Glu Val Tyr Val Pro Thr Val Phe 

275 280 285 

Glu Asn Tyr Val Ala Asp He Glu Val Asp Gly Lys Gin Val Glu Leu 

290 295 300 

Ala Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro 
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305 










310 










315 










320 




Leu 


Ser 


Tyr 


Pro 


Asp 
325 


Thr 


Asp 


Val 


lie 


Leu 
330 


Met 


Cys 


Phe 


Ser 


He 
335 


Asp 




Ser 


Pro 


Asp 


Ser 


Leu 


Glu 


Asn 


lie 


Pro 


Glu 


Lys 


Trp 


Thr 


Pro 


Glu 


Val 


c 
0 








34 0 










O yl C 










350 








Lys 


His 


Phe 


Cys 


Pro 


Asn 


Val 


Pro 


He 


He 


Leu 


val 


Gly Asn 


Lys 


Lys 








355 










360 










365 










Asp 


Leu 
370 


Arg 


Asn 


Asp 


Glu 


His 
375 


Thr 


Arg 


Arg 


Glu 


Leu 
380 


Ala 


Lys 


Met 


Lys 


10 


Gin 
385 


Glu 


Pro 


val 


Lys 


Pro 
390 


Glu 


Glu 


Gly 


Arg 


Asp 
395 


Met 


Ala 


Asn 


Arg 


He 
400 




Gly Ala 


Phe 


Gly 


Tyr 


Met 


Glu 


Cys 


Ser 


Ala 


Lys 


Thr 


Lys 


Asp 


Gly 


val 












405 










410 










415 






Arg 


GlU 


Val 


Phe 


Glu 


Met 


Ala 


Thr 


Arg 


Ala 


Ala 


Leu 


Gin 


Ala 


Arg 


Arg 


15 








420 










425 










430 








Gly 


Lys 


Lys 


Lys 


Ser 


Gly 


Cys 


Leu 


Val 


Leu 















435 440 



(2) INFORMATION FOR SEQ ID NO: 12 8: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(ix) FEATURE: 

30 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1137 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

35 

ATG GAC CAT TAT GAT TCT CAG 
Met Asp His Tyr Asp Ser Gin 
1 5 

40 GAG GAC TGG GAC CGG GAC CTG 
Glu Asp Trp Asp Arg Asp Leu 
20 

CAG AGA AAG ACA TTC ACG GCA 
45 Gin Arg Lys Thr Phe Thr Ala 
35 

GGG ACA CAG ATC GAG AAC ATC 
Gly Thr Gin He Glu Asn He 
50 50 55 

CTC ATG CTG CTG CTG GAG GTC 

Leu Met Leu Leu Leu Glu Val 
65 70 

55 

GAG CGA GGC AAG ATG AGA GTG 



CAA ACC AAC GAT TAC ATG CAG CCA GAA 4 8 

Gin Thr Asn Asp Tyr Met Gin Pro Glu 
10 15 

CTC CTG GAC CCG GCC TGG GAG AAG CAG 96 
Leu Leu Asp Pro Ala Trp Glu Lys Gin 
25 30 

TGG TGT AAC TCC CAC CTC CGG AAG GCG 144 

Trp Cys Asn Ser His Leu Arg Lys Ala 
40 45 

GAA GAG GAC TTC CGG GAT GGC CTG AAG 192 

Glu Glu Asp Phe Arg Asp Gly Leu Lys 
60 

ATC TCA GGT GAA CGC TTG GCC AAG CCA 240 

He Ser Gly Glu Arg Leu Ala Lys Pro 
75 80 

CAC AAG ATC TCC AAC GTC AAC AAG GCC 288 

249 
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Glu Arg Gly Lys Met Arg Val His Lys He Ser Asn Val Asn Lys Ala 
85 90 



95 



CTG GAT TTC ATA GCC AGC AAA GGC GTC AAA CTG GTG TCC ATC GGA 6CC 
Leu Asp Phe He Ala Ser Lys Gly Val Lys Leu Val Ser HI Sa 

105 

GAA GAA ATC GTG GAT GGG AAT GTC AAG ATG ACC CTO GGC ATG ATC TGG 
Glu Glu lie val ASP Gly Asn Val Lys Met Thr Leu Sy Met ile S 
lis 120 

Jhr ill Jlo r° f ° """^ =^ '^^^ ^TG GTG AGC AAG 

Thr lie He Leu Arg Arg Asp Pro Pro Val Ala Thr Met Val Ser Lys 

135 

GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG CTC GAC 
Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu L^ ^p 
"° "5 16^ 

Glv A^n Sit T"" '^'^ GAG GGC 

Gly Asp val Asn Gly Has Lys Phe Ser Val Ser Gly Glu Gly Glu Gly 



165 170 175 



GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC 
Asp Ala Thr TVr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Jhr Tly 
180 185 190 

AAG CTG CCC GTG CCC TGG CCC ACC CTC GT<3 ACC ACC CTC ACC TAC GGC 
Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu thr ^r Sy 
195 200 



205 



GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC 
val Gin cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin nil Isl 
210 215 220 

TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC 
Phe Lys ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Se 

235 240 

WS Asn "^"^ "^^^ """^ *AG TTC GAG 

Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu 

245 250 255 

""^^ CTG AAG GGC ATC GAC TTC AAG 

Gly ASP Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp pJe JJs 
260 265 270 

G?v tf r° "^^^ ™C ^ AGC 

Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser 



280 



285 



CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG 
H.S Asn val Tyr He Met Ala Asp Lys Gin Lys Asn fly He vlt 

AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC 



336 



384 



432 



480 



52 8 



576 



624 



672 



720 



768 



816 



864 



912 



960 

250 
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Asn Phe I*ys lie Arg His Asn lie Glu Asp Gly Ser Val Gin Leu Ala 

305 310 315 320 

GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAG GGC CCC GTG CTG CTG 1008 
5 Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro Val Leu Leu 

325 330 335 

CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC 1056 
Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro 
10 340 345 350 

AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC 1104 
Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala 
355 360 365 



15 



20 



GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 114 0 

Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
370 375 



(2) INFORMATION FOR SEQ ID NO:129: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 379 amino acids 

25 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
30 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 





Met 


Asp 


His 


Tyr 


Asp 


Ser 


Gin 


Gin 


Thr 


Asn 


Asp 


Tyr 


Met 


Gin 


Pro 


Glu 


35 


1 








5 










10 










15 






Glu 


Asp 


Trp 


Asp 
20 


Arg 


Asp 


Leu 


Leu 


Leu 
25 


Asp 


Pro 


Ala 


Trp 


Glu 
30 


Lys 


Gin 




Gin 


Arg 


Lys 
35 


Thr 


Phe 


Thr 


Ala 


Trp 
40 


Cys 


Asn 


Ser 


His 


Leu 
45 


Arg 


Lys 


Ala 


40 


Gly 


Thr 
50 


Gin 


He 


Glu 


Asn 


He 
55 


Glu 


Glu 


Asp 


Phe 


Arg 
60 


Asp 


Gly 


Leu 


Lys 




Leu 


Met 


Leu 


Leu 


Leu 


Glu 


Val 


He 


Ser 


Gly 


Glu 


Arg 


Leu 


Ala 


Lys 


Pro 




65 










70 










75 










80 




Glu 


Arg 


Gly 


Lys 


Met 


Arg 


Val 


His 


Lys 


He 


Ser 


Asn 


val 


Asn 


Lys 


Ala 


45 










85 










90 










95 






Leu 


Asp 


Phe 


He 


Ala 


Ser 


Lys 


Gly 


Val 


Lys 


Leu 


Val 


Ser 


He 


Gly Ala 










100 










105 










110 








Glu 


Glu 


He 


Val 


Asp 


Gly 


Asn 


Val 


Lys 


Met 


Thr 


Leu 


Gly Met 


He 


Trp 








115 










120 










125 








50 


Thr 


He 
130 


He 


Leu 


Arg 


Arg 


Asp 
135 


Pro 


Pro 


Val 


Ala 


Thr 
140 


Met 


Val 


Ser 


Lys 




Gly 


Glu 


Glu 


Leu 


Phe 


Thr 


Gly 


Val 


Val 


Pro 


He 


Leu 


Val 


Glu 


Leu 


Asp 




145 










150 










155 










160 




Gly 


Asp 


Val 


Asn 


Gly 


His 


Lys 


Phe 


Ser 


Val 


Ser 


Gly 


Glu 


Gly 


Glu 


Gly 


55 










165 










170 










175 






Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


He 


Cys 


Thr 


Thr 


Gly 
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180 185 



252 

190 



Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Glv 

3-95 200 205 

val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe 

215 220 

Phe Lys ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe 

235 240 

Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu 

Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Ihe Lys 

Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser 

2~5 280 285 

His Asn val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val 

295 

Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala 

315 

Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu III 

Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro 

•'^^ 345 
Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala 

355 265 
Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lvs 
370 375 



(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...3513 
(D) OTHER INFORMATION: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

ATO GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 
Met val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
^ 10 15 

vll liu f ^""^ "^^'^ TCC GGC 

val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Se He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 



48 



96 



144 



192 

252 
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Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAG GGC GTG GAG TGC TTC AGC CGC TAG CGG GAG GAG ATG AAG 240 
5 Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

GAG GAG GAG TTC TTC AAG TGC GGC ATG CGC GAA GGC TAG GTG CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
10 85 90 95 

CGC ACC ATG TTC TTC AAG GAG GAG GGC AAC TAG AAG ACC CGC GGC GAG 336 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

15 

GTG AAG TTC GAG GGC GAG ACC CTG GTG AAC CGC ATG GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

20 ATG GAG TTC AAG GAG GAC GGC AAC . ATG CTG GGG CAC AAG CTG GAG TAG 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAG AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 48 0 

25 Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
30 165 170 175 

GTG CAG CTC GCC GAC CAC TAG CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

35 

CCC GTG CTG CTG CCC GAC AAC CAC TAG CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

40 AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAG AAG TCC 720 
45 Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCC ATG AAC GCC CCC GAG CGG CAG CCC CAA CCC 768 
Gly Leu Arg Ser Arg Ala Met Asn Ala Pro Glu Arg Gin Pro Gin Pro 
50 245 250 255 

GAC GGC GGG GAC GCC CCA GGC CAC GAG CCT GGG GGC AGC CCC CAA GAC 816 
Asp Gly Gly Asp Ala Pro Gly His Glu Pro Gly Gly Ser Pro Gin Asp 
260 265 270 



55 



GAG CTT GAC TTC TCC ATC CTC TTC GAC TAT GAG TAT TTG AAT CGG AAC 864 
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470 475 



480 



912 



960 



254 

Glu Leu Asp Phe Ser He Leu Phe Asp Tyr Glu Tyr Leu Asn Pro Asn 
275 280 285 

GAA GAA GAG CCG AAT GCA CAT AAG GTC GCC AGO CCA CCC TCC GGA CCC 
Glu Glu Glu Pro Asn Ala His Lys Val Ala Ser Pro Pro Ser Gly Pro 
290 295 

GCA TAG CCC GAT GAT GTA ATG GAG TAT GGC CTC AAG CCA TAC AGC CCC 
Ala Tyr Pro Asp Asp Val Met Asp Tyr Gly Leu Lys Pro Tyr Ser Pro 

310 315 320 

CTT GCT AGT CTC TCT GGC GAG CCC CCC GGC CGA TTC GGA GAG CCG GAT 1008 
Leu Ala Ser Leu Ser Gly Glu Pro Pro Gly Arg Phe Gly Glu Pro Asp 
325 330 335 

AGG GTA GGG CCG CAG AAG TTT CTG AGC GCG GCC AAG CCA GCA GGG GCC 1056 
Arg val Gly Pro Gin Lys Phe Leu Ser Ala Ala Lys Pro Ala Gly Ala 

345 350 

TCG GGC CTG AGC CCT CGG ATC GAG ATC ACT CCG TCC CAC GAA CTG ATC 1104 
Ser Gly Leu Ser Pro Arg He Glu He Thr Pro Ser His Glu Leu He 
355 360 365 

CAG GCA GTG GGG CCC CTC CGC ATG AGA GAC GCG GGC CTC CTG GTG GAG 1152 
Gin Ala Val Gly Pro Leu Arg Met Arg Asp Ala Gly Leu Leu Val Glu 
370 375 380 

CAG CCT CCC CTG GCC GGG GTG GCC GCC AGC CCG AGG TTC ACC CTO CCC 1200 
Gin Pro Pro Leu Ala Gly Val Ala Ala Ser Pro Arg Phe Thr Leu Pro 

390 395 400 

GTG CCC GGC TTC GAG GGC TAC CGC GAG CCG CTT TGC TTG AGC CCC GCT 1248 
val Pro Gly Phe Glu Gly Tyr Arg Glu Pro Leu Cys Leu Ser Pro Ala 

405 410 415 

AGC AGC GGC TCC TCT GCC AGC TTC ATT TCT GAC ACC TTC TCC CCC TAC 1296 
Ser Ser Gly Ser Ser Ala Ser Phe He Ser Asp Thr Phe Ser Pro Tyr 
420 425 430 

ACC TCG CCC TGC GTC TCG CCC AAT AAC GGC GGG CCC GAC GAC CTG TGT 1344 
Thr Ser Pro Cys Val Ser Pro Asn Asn Gly Gly Pro Asp Asp Leu Cys 
435 440 445 

CCG CAG TTT CAA AAC ATC CCT GCT CAT TAT TCC CCC AGA ACC TCG CCA 1392 
Pro Gin Phe Gin Asn He Pro Ala His Tyr Ser Pro Arg Thr Ser Pro 
450 455 4gj, 

ATA ATG TCA CCT CGA ACC AGC CTC GCC GAG GAC AGC TGC CTG GGC CGC 1440 
He Met Ser Pro Arg Thr Ser Leu Ala Glu Asp Ser Cys Leu Gly Arg 



CAC TCG CCC GTG CCC CGT CCG GCC TCC CGC TCC TCA TCG CCT GGT GCC 1488 
His Ser pro Val Pro Arg Pro Ala Ser Arg Ser Ser Ser Pro Gly Ala 
485 490 495 

AAG CGG AGG CAT TCG TGC GCC GAG GCC TTG GTT GCC CTG CCG CCC GGA 1536 
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Lys Arg Arg His Ser Cys Ala Glu Ala Leu Val Ala Leu Pro Pro Gly 

500 505 510 

GCC TCA CCC CAG CGC TCC CGG AGC CCC TCG CCG CAG CCC TCA TCT CAC 1584 
5 Ala Ser Pro Gin Arg Ser Arg Ser Pro Ser Pro Gin Pro Ser Ser His 
515 520 525 

GTG GCA CCC CAG GAC CAC GGC TCC CCG GCT GGG TAC CCC CCT GTG GCT 1632 
Val Ala Pro Gin Asp His Gly Ser Pro Ala Gly Tyr Pro Pro Val Ala 
10 530 535 540 

GGC TCT GCC GTG ATC ATG GAT GCC CTG AAC AGC CTC GCC ACG GAC TCG 1680 

Gly Ser Ala Val lie Met Asp Ala Leu Asn Ser Leu Ala Thr Asp Ser 

545 550 555 560 

15 

CCT TGT GGG ATC CCC CCC AAG ATG TGG AAG ACC AGC CCT GAC CCC TCG 1728 

Pro Cys Gly lie Pro Pro Lys Met Trp Lys Thr Ser Pro Asp Pro Ser 
565 570 575 

20 CCG GTG TCT GCC GCC CCA TCC AAG GCC GGC CTG CCT CGC CAC ATC TAC 1776 

Pro Val Ser Ala Ala Pro Ser Lys Ala Gly Leu Pro Arg His lie Tyr 
580 585 590 

CCG GCC GTG GAG TTC CTG GGG CCC TGC GAG CAG GGC GAG AGG AGA AAC 1824 
25 Pro Ala Val Glu Phe Leu Gly Pro Cys Glu Gin Gly Glu Arg Arg Asn 
595 600 605 

TCG GCT CCA GAA TCC ATC CTG CTG GTT CCG CCC ACT TGG CCC AAG CCG 1872 
Ser Ala Pro Glu Ser lie Leu Leu Val Pro Pro Thr Trp Pro Lys Pro 
30 610 615 620 

CTG GTG CCT GCC ATT CCC ATC TGC AGC ATC CCA GTG ACT GCA TCC CTC 1920 

Leu Val Pro Ala lie Pro lie Cys Ser lie Pro Val Thr Ala Ser Leu 

625 630 635 640 

35 

CCT CCA CTT GAG TGG CCG CTG TCC AGT CAG TCA GGC TCT TAC GAG CTG 1968 

Pro Pro Leu Glu Trp Pro Leu Ser Ser Gin Ser Gly Ser Tyr Glu Leu 
645 650 655 

40 CGG ATC GAG GTG CAG CCC AAG CCA CAT CAC CGG GCC CAC TAT GAG AC A 2016 

Arg lie Glu Val Gin Pro Lys Pro His His Arg Ala His Tyr Glu Thr 
660 665 670 

GAA GGC AGC CGA GGG GCT GTC AAA GCT CCA ACT GGA GGC CAC CCT GTG 2064 
45 Glu Gly Ser Arg Gly Ala Val Lys Ala Pro Thr Gly Gly His Pro Val 
675 680 685 

GTT CAG CTC CAT GGC TAC ATG GAA AAC AAG CCT CTG GGA CTT CAG ATC 2112 
Val Gin Leu His Gly Tyr Met Glu Asn Lys Pro Leu Gly Leu Gin lie 
50 690 695 700 

TTC ATT GGG ACA GCT GAT GAG CGG ATC CTT AAG CCG CAC GCC TTC TAC 2160 
Phe lie Gly Thr Ala Asp Glu Arg lie Leu Lys Pro His Ala Phe Tyr 
705 710 715 720 



55 



CAG GTG CAC CGA ATC ACG GGG AAA ACT GTC ACC ACC ACC AGC TAT GAG 2208 
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Gin Val His Arg lie Thr Gly Lys Thr Val Thr Thr Thr Ser Tyr Glu 
725 730 

AAG ATA GTG GGC AAC ACC AAA GTC CTG GAG ATC CCC TTG GAG CCC AAA 2256 
Lys lie Val Gly Asn Thr Lys Val Leu Glu He Pro Leu Glu Pro Lys 
740 745 750 

AAC AAC ATG AGG GCA ACC ATC GAC TGT GCG GGG ATC TTG AAG CTT AGA 2304 
Asn Asn Met Arg Ala Thr He Asp Cys Ala Gly He Leu Lys Leu Arg 
755 760 765 

AAC GCC GAC ATT GAG CTG CGG AAA GGC GAG ACG GAC ATT GGA AGA AAG 2 352 
Asn Ala Asp He Glu Leu Arg Lys Gly Glu Thr Asp He Gly Arq Lys 
770 775 780 

AAC ACG CGG GTG AGA CTG GTT TTC CGA GTT CAC ATC CCA GAG TCC AGT 2400 
Asn Thr Arg Val Arg Leu Val Phe Arg Val His He Pro Glu Ser Ser 
'^^^ 790 795 800 

GGC AGA ATC GTC TCT TTA CAG ACT GCA TCT AAC CCC ATC GAG TGC TCC 2448 
Gly Arg He Val Ser Leu Gin Thr Ala Ser Asn Pro He Glu Cys Ser 
805 810 815 

CAG CGA TCT GCT CAC GAG CTG CCC ATG GTT GAA AGA CAA GAC ACA GAC 24 96 

Gin Arg Ser Ala His Glu Leu Pro Met Val Glu Arg Gin Asp Thr Asp 
820 825 830 

AGC TGC CTG GTC TAT GGC GGC CAG CAA ATG ATC CTC ACG GGG CAG AAC 2544 
Ser Cys Leu Val Tyr Gly Gly Gin Gin Met He Leu Thr Gly Gin Asn 
835 840 845 

TTT ACA TCC GAG TCC AAA GTT GTG TTT ACT GAG AAG ACC ACA GAT GGA 2592 
Phe Thr Ser Glu Ser Lys Val Val Phe Thr Glu Lys Thr Thr Asp Gly 
850 855 860 

CAG CAA ATT TGG GAG ATG GAA GCC ACG GTG GAT AAG GAC AAG AGC CAG 2 640 

Gin Gin He Trp Glu Met Glu Ala Thr Val Asp Lys Asp Lys Ser Gin 

870 875 880 

CCC AAC ATG CTT TTT GTT GAG ATC CCT GAA TAT CGG AAC AAG CAT ATC 2688 
Pro Asn Met Leu Phe Val Glu He Pro Glu Tyr Arg Asn Lys His He 
885 890 895 

CGC ACA CCT GTA AAA GTG AAC TTC TAC GTC ATC AAT GGG AAG AGA AAA 2736 
Arg Thr Pro Val Lys Val Asn Phe Tyr Val He Asn Gly Lys Arg Lys 
900 905 910 

CGA AGT CAG CCT CAG CAC TTT ACC TAC CAC CCA GTC CCA GCC ATC AAG 2784 
Arg Ser Gin Pro Gin His Phe Thr Tyr His Pro Val Pro Ala He Lys 
915 920 925 

ACG GAG CCC ACG GAT GAA TAT GAC CCC ACT CTG ATC TGC AGC CCC ACC 2 832 

Thr Glu Pro Thr Asp Glu Tyr Asp Pro Thr Leu He Cys Ser Pro Thr 
530 935 940 

CAT GGA GGC CTG GGG AGC CAG CCT TAC TAC CCC CAG CAC CCG ATG GTG 2880 
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His Gly Gly Leu Gly Ser Gin Pro Tyr Tyr Pro Gin His Pro Met Val 

945 950 955 960 

GCC GAG TCC CCC TCC TGC CTC GTG GCC ACC ATG GCT CCC TGC CAG CAG 2928 
5 Ala Glu Ser Pro Ser Cys Leu Val Ala Thr Met Ala Pro Cys Gin Gin 

965 970 975 

TTC CGC ACG GGG CTC TCA TCC CCT GAC GCC CGC TAG CAG CAA CAG AAC 2 976 

Phe Arg Thr Gly Leu Ser Ser Pro Asp Ala Arg Tyr Gin Gin Gin Asn 
10 980 985 990 

CCA GCG GCC GTA CTC TAC CAG CGG AGC AAG AGC CTG AGC CCC AGC CTG 3 024 

Pro Ala Ala Val Leu Tyr Gin Arg Ser Lys Ser Leu Ser Pro Ser Leu 
995 1000 1005 

15 

CTG GGC TAT CAG CAG CCG GCC CTC ATG GCC GCC CCG CTG TCC CTT GCG 3 072 

Leu Gly Tyr Gin Gin Pro Ala Leu Met Ala Ala Pro Leu Ser Leu Ala 
1010 1015 1020 

20 GAC GCT CAC CGC TCT GTG CTG GTG CAC GCC GGC TCC CAG GGC CAG AGC 3120 
Asp Ala His Arg Ser Val Leu Val His Ala Gly Ser Gin Gly Gin Ser 
1025 1030 1035 1040 

TCA GCC CTG CTC CAC CCC TCT CCG ACC AAC CAG CAG GCC TCG CCT GTG 3168 
25 Ser Ala Leu Leu His Pro Ser Pro Thr Asn Gin Gin Ala Ser Pro Val 

1045 1050 1055 

ATC CAC TAC TCA CCC ACC AAC CAG CAG CTG CGC TGC GGA AGC CAC CAG 3216 
lie His Tyr Ser Pro Thr Asn Gin Gin Leu 7U:g Cys Gly Ser His Gin 
30 1060 1065 1070 

GAG TTC CAG CAC ATC ATG TAC TGC GAG AAT TTC GCA CCA GGC ACC ACC 3 2 64 

Glu Phe Gin His lie Met Tyr Cys Glu Asn Phe Ala Pro Gly Thr Thr 
1075 1080 1085 

35 

AGA CCT GGC CCG CCC CCG . GTC AGT CAA GGT CAG AGG CTG AGC CCG GGT 3 312 

Arg Pro Gly Pro Pro Pro Val Ser Gin Gly Gin Arg Leu Ser Pro Gly 
1090 1095 1100 

40 TCC TAC CCC ACA GTC ATT CAG CAG CAG AAT GCC ACG AGC CAA AGA GCC 3360 

Ser Tyr Pro Thr Val lie Gin Gin Gin Asn Ala Thr Ser Gin Arg Ala 
1105 1110 1115 1120 

GCC AAA AAC GGA CCC CCG GTC AGT GAC CAA AAG GAA GTA TTA CCT GCG 34 08 

45 Ala Lys Asn Gly Pro Pro Val Ser Asp Gin Lys Glu Val Leu Pro Ala 

1125 1130 1135 

GGG GTG ACC ATT AAA CAG GAG CAG AAC TTG GAC CAG ACC TAC TTG GAT 3456 
Gly Val Thr lie Lys Gin Glu Gin Asn Leu Asp Gin Thr Tyr Leu Asp 
50 1140 1145 1150 

GAT GTT AAT GAA ATT ATC AGG AAG GAG TTT TCA GGA CCT CCT GCC AGA 3 5 04 

Asp Val Asn Glu lie lie Arg Lys Glu Phe Ser Gly Pro Pro Ala Arg 
1155 1160 1165 
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AAT CAG ACG TAA 3516 
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258 

Asn Gin Thr 
1170 



(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE; internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
25 50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

30 Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
35 130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

40 val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
45 210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Met Asn Ala Pro Glu Arg Gin Pro Gin Pro 

245 250 255 

Asp Gly Gly Asp Ala Pro Gly His Glu Pro Gly Gly Ser Pro Gin Asp 

260 265 270 

Glu Leu Asp Phe Ser He Leu Phe Asp Tyr Glu Tyr Leu Asn Pro Asn 

275 280 285 

Glu Glu Glu Pro Asn Ala His Lys Val Ala Ser Pro Pro Ser Gly Pro 
55 290 295 300 

Ala Tyr Pro Asp Asp Val Met Asp Tyr Gly Leu Lys Pro Tyr Ser Pro 
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305 310 315 320 

Leu Ala Ser Leu Ser Gly Glu Pro Pro Gly Arg Phe Gly Glu Pro Asp 

325 330 335 

Arg Val Gly Pro Gin Lys Phe Leu Ser Ala Ala Lys Pro Ala Gly Ala 
5 340 345 350 

Ser Gly Leu Ser Pro Arg lie Glu lie Thr Pro Ser His Glu Leu lie 

355 360 365 

Gin Ala Val Gly Pro Leu Arg Met Arg Asp Ala Gly Leu Leu Val Glu 
370 375 380 

10 Gin Pro Pro Leu Ala Gly Val Ala Ala Ser Pro Arg Phe Thr Leu Pro 
385 390 395 400 

Val Pro Gly Phe Glu Gly Tyr Arg Glu Pro Leu Cys Leu Ser Pro Ala 

405 410 415 

Ser Ser Gly Ser Ser Ala Ser Phe lie Ser Asp Thr Phe Ser Pro Tyr 
15 420 425 430 

Thr Ser Pro Cys Val Ser Pro Asn Asn Gly Gly Pro Asp Asp Leu Cys 

435 440 445 

Pro Gin Phe Gin Asn lie Pro Ala His Tyr Ser Pro Arg Thr Ser Pro 
450 455 460 

20 lie Met Ser Pro Arg Thr Ser Leu Ala Glu Asp Ser Cys Leu Gly Arg 
465 470 475 480 

His Ser Pro Val Pro Arg Pro Ala Ser Arg Ser Ser Ser Pro Gly Ala 

485 490 495 

Lys Arg Arg His Ser Cys Ala Glu Ala Leu Val Ala Leu Pro Pro Gly 
25 500 505 510 

Ala Ser Pro Gin Arg Ser Arg Ser Pro Ser Pro Gin Pro Ser Ser His 

515 520 525 

Val Ala Pro Gin Asp His Gly Ser Pro Ala Gly Tyr Pro Pro Val Ala 
530 535 540 

30 Gly Ser Ala Val lie Met Asp Ala Leu Asn Ser Leu Ala Thr Asp Ser 
545 550 555 560 

Pro Cys Gly lie Pro Pro Lys Met Trp Lys Thr Ser Pro Asp Pro Ser 

565 570 575 

Pro Val Ser Ala Ala Pro Ser Lys Ala Gly Leu Pro Arg His lie Tyr 
35 580 585 590 

Pro Ala Val Glu Phe Leu Gly Pro Cys Glu Gin Gly Glu Arg Arg Asn 

595 600 605 

Ser Ala Pro Glu Ser lie Leu Leu Val Pro Pro Thr Trp Pro Lys Pro 
610 615 620 

40 Leu Val Pro Ala lie Pro lie Cys Ser lie Pro Val Thr Ala Ser Leu 
625 630 635 640 

Pro Pro Leu Glu Trp Pro Leu Ser Ser Gin Ser Gly Ser Tyr Glu Leu 

645 650 655 

Arg lie Glu Val Gin Pro Lys Pro His His Arg Ala His Tyr Glu Thr 
45 660 665 670 

Glu Gly Ser Arg Gly Ala Val Lys Ala Pro Thr Gly Gly His Pro Val 

675 680 685 

Val Gin Leu His Gly Tyr Met Glu Asn Lys Pro Leu Gly Leu Gin lie 
690 695 700 

50 Phe lie Gly Thr Ala Asp Glu Arg lie Leu Lys Pro His Ala Phe Tyr 
705 710 715 720 

Gin Val His Arg lie Thr Gly Lys Thr Val Thr Thr Thr Ser Tyr Glu 

725 730 735 

Lys lie Val Gly Asn Thr Lys Val Leu Glu lie Pro Leu Glu Pro Lys 
55 740 745 750 

Asn Asn Met Arg Ala Thr lie Asp Cys Ala Gly lie Leu Lys Leu Arg 
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755 760 765 

Asn Ala Asp He Glu Leu Arg Lys Gly Glu Thr Asp He Gly Arg Lys 

770 775 
Asn Thr Arg Val Arg Leu Val Phe Arg Val His He Pro Glu Ser Ser 

°^ 790 795 

Gly Arg He Val Ser Leu Gin Thr Ala Ser Asn Pro He Glu Cys Ser 

805 810 815 

Gin Arg Ser Ala His Glu Leu Pro Met Val Glu Arg Gin Asp Thr Asp 

c r. "° 830 

ser Cys Leu Val Tyr Gly Gly Gin Gin Met He Leu Thr Gly Gin Asn 

835 840 845 

Phe Thr Ser Glu Ser Lys Val Val Phe Thr Glu Lys Thr Thr Asp Gly 

850 855 860 

Gin Gin He Trp Glu Met Glu Ala Thr Val Asp Lys Asp Lys Ser Gin 

875 880 
Pro Asn Met Leu Phe Val Glu He Pro Glu Tyr Arg Asn Lys His He 

885 890 895 

Arg Thr Pro Val Lys Val Asn Phe Tyr Val He Asn Gly Lys Arg Lys 

900 905 
Arg ser Gin Pro Gin His Phe Thr Tyr His Pro Val Pro Ala He Lys 

9" 920 925 

Thr Glu Pro Thr Asp Glu Tyr Asp Pro Thr Leu He Cys Ser Pro Thr 

"0 935 940 

His Gly Gly Leu Gly Ser Gin Pro Tyr Tyr Pro Gin His Pro Met Val 
r 955 960 

Ala Glu ser Pro Ser Cys Leu Val Ala Thr Met Ala Pro Cys Gin Gin 

9S5 970 
Phe Arg Thr Gly Leu Ser Ser Pro Asp Ala Arg Tyr Gin Gin Gin Asn 

980 985 990 

Pro Ala Ala Val Leu Tyr Gin Arg Ser Lys Ser Leu Ser Pro Ser Leu 

1000 1005 
Leu Gly Tyr Gin Gin Pro Ala Leu Met Ala Ala Pro Leu Ser Leu Ala 

1010 1015 1020 

Asp Ala His Arg Ser Val Leu Val His Ala Gly Ser Gin Gly Gin Ser 
1030 1035 1040 

Ser Ala Leu Leu His Pro Ser Pro Thr Asn Gin Gin Ala Ser Pro Val 

. 1045 1050 1055 

He His Tyr Ser Pro Thr Asn Gin Gin Leu Arg Cys Gly Ser His Gin 

1060 1065 1070 

Glu Phe Gin His He Met Tyr Cys Glu Asn Phe Ala Pro Gly Thr Thr 

1075 1080 1085 

Arg Pro Gly Pro Pro Pro Val Ser Gin Gly Gin Arg Leu Ser Pro Glv 

1090 1095 1100 

Ser Tyr Pro Thr Val He Gin Gin Gin Asn Ala Thr Ser Gin Arg Ala 
1°^ , 1110 1115 1120 

Ala Lys Asn Gly Pro Pro Val Ser Asp Gin Lys Glu Val Leu Pro Ala 

1125 1130 1135 

Gly Val Thr He Lys Gin Glu Gin Asn Leu Asp Gin Thr Tyr Leu Asp 

1140 1145 1150 

Asp Val Asn Glu He He Arg Lys Glu Phe Ser Gly Pro Pro Ala Arg 

1155 1160 1165 

Asn Gin Thr 
1170 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

10 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...3543 
(D) OTHER INFORMATION: 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 2: 

ATG AAC GCC CCC GAG CGG CAG CCC CAA CCC GAC GGC GGG GAC GCC CCA 48 
Met Asn Ala Pro Glu Arg Gin Pro Gin Pro Asp Gly Gly Asp Ala Pro 
15 10 15 

20 GGC CAC GAG CCT GGG GGC AGC CCC CAA GAC GAG CTT GAC TTC TCC ATC 96 
Gly His Glu Pro Gly Gly Ser Pro Gin Asp Glu Leu Asp Phe Ser lie 
20 25 30 

CTC TTC GAC TAT GAG TAT TTG AAT COG AAC GAA GAA GAG CCG AAT GCA 144 
25 Leu Phe Asp Tyr Glu Tyr Leu Asn Pro Asn Glu Glu Glu Pro Asn Ala 
35 40 45 

CAT AAG GTC GCC AGC CCA CCC TCC GGA CCC GCA TAC CCC GAT GAT OTA 192 
His Lys Val Ala Ser Pro Pro Ser Gly Pro Ala Tyr Pro Asp Asp Val 
30 50 55 60 

ATG GAC TAT GGC CTC AAG CCA TAC AGC CCC CTT GCT AGT CTC TCT GGC 24 0 

Met Asp Tyr Gly Leu Lys Pro Tyr Ser Pro Leu Ala Ser Leu Ser Gly 
65 70 75 80 

35 

GAG CCC CCC GGC CGA TTC GGA GAG CCG GAT AGG GTA GGG CCG CAG AAG 2 88 

Glu Pro Pro Gly Arg Phe Gly Glu Pro Asp Arg Val Gly Pro Gin Lys 
85 90 95 

40 TTT CTG AGC GCG GCC AAG CCA GCA GGG GCC TCG GGC CTG AGC CCT CGG 336 

Phe Leu Ser Ala Ala Lys Pro Ala Gly Ala Ser Gly Leu Ser Pro Arg 
100 105 110 

ATC GAG ATC ACT CCG TCC CAC GAA CTG ATC CAG GCA GTG GGG CCC CTC 384 
45 lie Glu lie Thr Pro Ser His Glu Leu lie Gin Ala Val Gly Pro Leu 
115 120 125 

CGC ATG AGA GAC GCG GGC CTC CTG GTG GAG CAG CCT CCC CTG GCC GGG 432 
Arg Met Arg Asp Ala Gly Leu Leu Val Glu Gin Pro Pro Leu Ala Gly 
50 130 135 140 

GTG GCC GCC AGC CCG AGG TTC ACC CTG CCC GTG CCC GGC TTC GAG GGC 4 80 

Val Ala Ala Ser Pro Arg Phe Thr Leu Pro Val Pro Gly Phe Glu Gly 
145 150 155 160 



55 



TAC CGC GAG CCG CTT TGC TTG AGC CCC GCT AGC AGC GGC TCC TCT GCC 528 
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Tyr Arg Glu Pro Leu Cys Leu Ser Pro Ala Ser Ser Gly Ser Ser Ala 
165 175 

AGC TTC ATT TCT GAC ACC TTC TCC CCC TAG ACC TCG CCC TGC GTC TOG 576 
ser Phe lie Ser Asp Thr Phe Ser Pro Tyr Thr Ser Pro Cys Val Ser 

185 190 

CCC AAT AAC GGC GGG CCC GAC GAC CTG TGT CCG CAG TTT CAA AAC ATC 
Pro Asn Asn Gly Gly Pro Asp Asp Leu Cys Pro Gin Phe Gin Asn He 

200 205 

CCT GCT CAT TAT TCC CCC AGA ACC TCG CCA ATA ATG TCA CCT CGA ACC 672 
Pro Ala His Tyr Ser Pro Arg Thr Ser Pro He Met Ser Pro Arg Thr 

215 220 



CTG CTG GTT CCG CCC ACT TGG CCC AAG CCG CTG GTG CCT GCC ATT CCC 
Leu Leu Val Pro Pro Thr Trp Pro Lys Pro Leu Val Pro Ala He Pro 

375 380 

ATC TGC AGC ATC CCA GTG ACT GCA TCC CTC CCT CCA CTT GAG TGG CCG 



624 



AGC CTC GCC GAG GAC AGC TGC CTG GGC CGC CAC TCG CCC GTG CCC CGT 
Ser Leu Ala Glu Asp Ser Cys Leu Gly Arg His Ser Pro Val Pro Arq 
230 235 240 

CCG GCC TCC CGC TCC TCA TCG CCT GGT GCC AAG CGG AGG CAT TCG TGC 
Pro Ala Ser Arg Ser Ser Ser Pro Gly Ala Lys Arg Arg His Ser Cys 
245 250 255 

GCC GAG GCC TTG GTT GCC CTG CCG CCC GGA GCC TCA CCC CAG CGC TCC 
Ala Glu Ala Leu Val Ala Leu Pro Pro Gly Ala Ser Pro Gin Arg Ser 
260 265 270 

CGG AGC CCC TCG CCG CAG CCC TCA TCT CAC GTG GCA CCC CAG GAC CAC 
Arg Ser Pro Ser Pro Gin Pro Ser Ser His Val Ala Pro Gin Asp His 
275 280 285 

GGC TCC CCG GCT GGG TAC CCC CCT GTG GCT GGC TCT GCC GTG ATC ATG 
Gly Ser Pro Ala Gly Tyr Pro Pro Val Ala Gly Ser Ala Val He Met 
290 295 300 

GAT GCC CTG AAC AGC CTC GCC ACG GAC TCG CCT TGT GGG ATC CCC CCC 
Asp Ala Leu Asn Ser Leu Ala Thr Asp Ser Pro Cys Gly He Pro Pro 

310 315 320 

AAG ATG TGG AAG ACC AGC CCT GAC CCC TCG CCG GTG TCT GCC GCC CCA 
Lys Met Trp Lys Thr Ser Pro Asp Pro Ser Pro Val Ser Ala Ala Pro 
325 330 

TCC AAG GCC GGC CTG CCT CGC CAC ATC TAC CCG GCC GTG GAG TTC CTG 
Ser Lys Ala Gly Leu Pro Arg His He Tyr Pro Ala Val Glu Phe Leu 
340 345 350 

GGG CCC TGC GAG CAG GGC GAG AGG AGA AAC TCG GCT CCA GAA TCC ATC 1104 
Gly Pro Cys Glu Gin Gly Glu Arg Arg Asn Ser Ala Pro Glu Ser He 
355 360 365 



720 



768 



816 



864 



912 



960 



1008 



1056 



1152 



1200 

262 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 PCT/DK98/00145 

263 

lie Cys Ser lie Pro Val Thr Ala Ser Leu Pro Pro Leu Glu Trp Pro 
385 390 395 400 

CTG TCC AGT CAG TCA GGC TCT TAG GAG CTG CGG ATC GAG GTG CAG CCC 1248 
5 Leu Ser Ser Gin Ser Gly Ser Tyr Glu Leu Arg lie Glu Val Gin Pro 

405 410 415 

AAG CCA CAT CAC CGG GCC CAC TAT GAG ACA GAA GGC AGC CGA GGG GCT 1296 
Lys Pro His His Arg Ala His Tyr Glu Thr Glu Gly Ser Arg Gly Ala 
10 420 425 430 

GTC AAA GCT CCA ACT GGA GGC CAC CCT GTG GTT CAG CTC CAT GGC TAC 1344 

Val Lys Ala Pro Thr Gly Gly His Pro Val Val Gin Leu His Gly Tyr 
435 440 445 

15 

ATG GAA AAC AAG CCT CTG GGA CTT CAG ATC TTC ATT GGG ACA GCT GAT 13 92 

Met Glu Asn Lys Pro Leu Gly Leu Gin lie Phe lie Gly Thr Ala Asp 
450 455 460 

20 GAG CGG ATC CTT AAG CCG CAC GCC TTC TAC CAG GTG CAC CGA ATC ACG 1440 

Glu Arg He Leu Lys Pro His Ala Phe Tyr Gin Val His Arg He Thr 
465 470 475 480 

GGG AAA ACT GTC ACC ACC ACC AGC TAT GAG AAG ATA GTG GGC AAC ACC 14 88 

25 Gly Lys Thr Val Thr Thr Thr Ser Tyr Glu Lys He Val Gly Asn Thr 

485 490 495 

AAA GTC CTG GAG ATC CCC TTG GAG CCC AAA AAC AAC ATG AGG GCA ACC 1536 
Lys Val Leu Glu He Pro Leu Glu Pro Lys Asn Asn Met Arg Ala Thr 
30 500 505 510 

ATC GAC TGT GCG GGG ATC TTG AAG CTT AGA AAC GCC GAC ATT GAG CTG 1584 

He Asp Cys Ala Gly He Leu Lys Leu Arg Asn Ala Asp He Glu Leu 
515 520 525 

35 

CGG AAA GGC GAG ACG GAC ATT GGA AGA AAG AAC ACG CGG GTG AGA CTG 1632 

Arg Lys Gly Glu Thr Asp He Gly Arg Lys Asn Thr Arg Val Arg Leu 
530 535 540 

40 GTT TTC CGA GTT CAC ATC CCA GAG TCC AGT GGC AGA ATC GTC TCT TTA 1680 
Val Phe Arg Val His He Pro Glu Ser Ser Gly Arg He Val Ser Leu 
545 550 555 560 

CAG ACT GCA TCT AAC CCC ATC GAG TGC TCC CAG CGA TCT GCT CAC GAG 1728 
45 Gin Thr Ala Ser Asn Pro He Glu Cys Ser Gin Arg Ser Ala His Glu 

565 570 575 

CTG CCC ATG GTT GAA AGA CAA GAC ACA GAC AGC TGC CTG GTC TAT GGC 1776 
Leu Pro Met Val Glu Arg Gin Asp Thr Asp Ser Cys Leu Val Tyr Gly 
50 580 585 590 

GGC CAG CAA ATG ATC CTC ACG GGG CAG AAC TTT ACA TCC GAG TCC AAA 1824 
Gly Gin Gin Met He Leu Thr Gly Gin Asn Phe Thr Ser Glu Ser Lys 
595 600 605 



55 



GTT GTG TTT ACT GAG AAG ACC ACA GAT GGA CAG CAA ATT TGG GAG ATG 1872 
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Val val Phe Thr Glu Lys Thr Thr Asp Gly Gin Gin He Trp Glu Met 

S15 



620 



GAA GCC ACG GTG GAT AAG GAG AAG AGO CAG CCC AAC ATG CTT TTT GTT 
Glu Ala Thr val Asp Lys Asp Lys Ser Gin Pro Asn Met Leu PlJI SI 



"° "5 640 

ill rll '^'^ '^'^ '^^^ CCT OTA AAA GTG 

Glu He Pro Glu Tyr Arg Asn Lys His He Arg Thr Pro Val Lys Val 
645 

AAC TTC TAG GTC ATC AAT GGG AAG AGA AAA CGA AGT CAG CCT CAG CAC 
Asn Phe Tyr Val He Asn Gly Lys Arg Lys Arg Ser Gin Pro Sn Ss 

665 

I'''' '^'^ ACG GAG CCC ACG GAT GAA 

Phe Thr Tyr His Pro Val Pro Ala He Lys Thr Glu Pro Thr Asp ^ 
675 680 685 

TAT GAC CCC ACT CTG ATC TGC AGC CCC ACC CAT GGA GGC CTG GGG AGC 
Tyr ASP Pro Thr Leu He Cys Ser Pro Thr His Gly Gly Leu Gly Jer 
690 695 700 

''''^ """^ ^""^ "^^^ °™ TCC CCC TCC TGC 

Gin Pro Tyr Tyr Pro Gin His Pro Met Val Ala Glu Ser Pro Ser Cys 



715 



720 



T V ? ''u'' -^"^ TTC CGC ACG GGG CTC TCA 

Leu val Ala Thr Met Ala Pro Cys Gin Gin Phe Arg Thr Gly Leu Se^ 
725 730 „5 

TCC CCT GAC GCC CGC TAC CAG CAA CAG AAC CCA GCG GCC GTA CTC TAC 
ser Pro Asp Ala Arg Tyr Gin Gin Gin Asn Pro Ala Ala Val Su ^ 
740 745 



CAG CGG AGC AAG AGC CTG AGC CCC AGC CTG CTG GGC TAT CAG CAG CCG 
Gin Arg Ser Lys Ser Leu Ser Pro Ser Leu Leu Gly Tyr Gin Gin Pro 
755 760 765 

Sa Leu SI^ l^"" f ° '''^ '^^'^ CAC CGC TOT GTG 

Ala Leu Met Ala Ala Pro Leu Ser Leu Ala Asp Ala His Arg Ser Val 

775 780 

CTG GTG CAC GCC GGC TCC CAG GGC CAG AGC TCA GCC CTG CTC CAC CCC 
^eu val His Ala Gly Ser Gin Gly Gin Ser Ser Ala Leu Leu Ss Pro 

795 800 

TCT CCG ACC AAC CAG CAG GCC TCG CCT GTG ATC CAC TAC TCA CCC ACP 
ser Pro Thr Asn Gin Gin Ala Ser Pro Val He His ^r Pro Jhr 

805 810 815 

Sn S^n S° f ^^'^ "^^^ CAG CAC ATC ATG 

Asn Gin Gin Leu Arg Cys Gly Ser His Gin Glu Phe Gin His He Met 

82 0 



825 



830 



TAC TGC GAG AAT TTC GCA CCA GGC ACC ACC AGA CCT GGC CCG CCC CCG 
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1968 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



2496 



2544 
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Tyr Cys Glu Asn Phe Ala Pro Gly Thr Thr Arg Pro Gly Pro Pro Pro 
835 840 845 

GTC AGT CAA GGT CAG AGG CTG AGC CCG GGT TCC TAG CCC ACA GTC ATT 2592 
5 Val Ser Gin Gly Gin Arg Leu Ser Pro Gly Ser Tyr Pro Thr Val lie 
850 855 860 

CAG CAG CAG AAT GCC ACG AGC CAA AGA GCC GCC AAA AAC GGA CCC CCG 2640 
Gin Gin Gin Asn Ala Thr Ser Gin Arg Ala Ala Lys Asn Gly Pro Pro 
10 865 870 875 880 

GTC AGT GAC CAA AAG GAA GTA TTA CCT GCG GGG GTG ACC ATT AAA CAG 2688 
Val Ser Asp Gin Lys Glu Val Leu Pro Ala Gly Val Thr lie Lys Gin 
885 890 895 

15 

GAG CAG AAC TTG GAC CAG ACC TAC TTG GAT GAT GTT AAT GAA ATT ATC 2736 
Glu Gin Asn Leu Asp Gin Thr Tyr Leu Asp Asp Val Asn Glu lie lie 
900 905 910 

20 AGG AAG GAG TTT TCA GGA CCT CCT GCC AGA AAT CAG ACG AGA ATT CTG 2784 
Arg Lys Glu Phe Ser Gly Pro Pro Ala Arg Asn Gin Thr Arg lie Leu 
915 920 925 

CAG TCG ACG GTA CCG CGG GCC CGG GAT CCA CCG GTC GCC ACC ATG GTG 2832 
25 Gin Ser Thr Val Pro Arg Ala Arg Asp Pro Pro Val Ala Thr Met Val 
930 935 940 

AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG 2880 
Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu 
30 945 950 955 960 

CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC 2928 

Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 

965 970 975 

35 

GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC 2976 

Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys Thr 
980 985 990 

40 ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC 3024 
Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 
995 1000 1005 

TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG CAC 3072 
45 Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His 
1010 1015 1020 

GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC 3120 
Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr 
50 1025 1030 1035 1040 

ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG 3168 
lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 
1045 1050 1055 



55 



TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC GAC 3216 
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Phe Glu Gly ASP Thr Leu Val Asn Arg He Glu Leu Lys aly He Asp 
1060 1065 1070 

TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC TAC 
Phe Lys Glu Asp Gly Asa He Leu Gly His Lys Leu Su ^ JJ" 
10" 1080 1085 

AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC 
Asn^Ser His Asn Val Tyr He Met Ala Asp Lys Gin ^s Gly H^ 

1100 

vll f AGC GTG CAG 
^Lys val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Sn 
mo 1115 ^^^^ 

CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG 
i.eu Ala ASP His Tvr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
"25 1130 ^^35 

CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA 
Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Su Ser 
"40 1145 ^^^^ 

GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC 
ASP Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu III Tl rlr 

1160 1165 

111 111 Glv if '^'^ ™<= 

^ ^^"^ Leu Tyr Lys 

1"5 1180 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDHESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
Met Asn Ala Pro Glu Arg Gin Pro Gin Pro Asp Gly Gly Asp Ala Pro 
Gly His Glu Pro Gly Gly Ser Pro Gin ^sp Glu Leu Asp Phe Ser He 
Leu Phe ASP Tyr Glu Tyr Leu Asn Pro Asn Glu Glu Glu Pro Asn Ala 
His Lys val Ala Ser Pro Pro Ser Gly Pro Ala Tyr tlo Asp Asp Val 
Met Asp Tyr Gly Leu Lys Pro Tyr Ser Pro Leu IL ser Leu Ser Gly 
Glu pro pro Gly Arg Phe Gly Glu Pro Asp Irg Val Gly Pro Gin L^s 



3264 



3312 



3360 



3408 



3456 



3504 



3546 
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85 



90 



95 



10 



16 



20 



25 



30 



35 



40 



45 



50 



Phe Leu Ser Ala Ala hys Pro Ala Gly Ala Ser Gly Leu Ser Pro Arg 

100 105 110 

lie Glu lie Thr Pro Ser His Glu Leu lie Gin Ala Val Gly Pro Leu 

115 120 125 

Arg Met Arg Asp Ala Gly Leu Leu Val Glu Gin Pro Pro Leu Ala Gly 

130 135 140 

Val Ala Ala Ser Pro Arg Phe Thr Leu Pro Val Pro Gly Phe Glu Gly 
145 150 155 160 

Tyr Arg Glu Pro Leu Cys Leu Ser Pro Ala Ser Ser Gly Ser Ser Ala 

165 170 175 

Ser Phe lie Ser Asp Thr Phe Ser Pro Tyr Thr Ser Pro Cys Val Ser 

180 185 190 

Pro Asn Asn Gly Gly Pro Asp Asp Leu Cys Pro Gin Phe Gin Asn He 

195 200 205 

Pro Ala His Tyr Ser Pro Arg Thr Ser Pro He Met Ser Pro Arg Thr 

210 215 220 

Ser Leu Ala Glu Asp Ser Cys Leu Gly Arg His Ser Pro Val Pro Arg 
225 230 235 240 

Pro Ala Ser Arg Ser Ser Ser Pro Gly Ala Lys Arg Arg His Ser Cys 

245 250 255 

Ala Glu Ala Leu Val Ala Leu Pro Pro Gly Ala Ser Pro Gin Arg Ser 

260 265 270 

Arg Ser Pro Ser Pro Gin Pro Ser Ser His Val Ala Pro Gin Asp His 

275 280 285 

Gly Ser Pro Ala Gly Tyr Pro Pro Val Ala Gly Ser Ala Val He Met 

290 295 300 

Asp Ala Leu Asn Ser Leu Ala Thr Asp Ser Pro Cys Gly He Pro Pro 
305 310 315 320 

Lys Met Trp Lys Thr Ser Pro Asp Pro Ser Pro Val Ser Ala Ala Pro 

325 330 335 

Ser Lys Ala Gly Leu Pro Arg His He Tyr Pro Ala Val Glu Phe Leu 

340 345 350 

Gly Pro Cys Glu Gin Gly Glu Arg Arg Asn Ser Ala Pro Glu Ser He 

355 360 365 

Leu Leu Val Pro Pro Thr Trp Pro Lys Pro Leu Val Pro Ala He Pro 

370 375 380 

He Cys Ser He Pro Val Thr Ala Ser Leu Pro Pro Leu Glu Trp Pro 
385 390 395 400 

Leu Ser Ser Gin Ser Gly Ser Tyr Glu Leu Arg He Glu Val Gin Pro 

405 410 415 

Lys Pro His His Arg Ala His Tyr Glu Thr Glu Gly Ser Arg Gly Ala 

420 425 430 

Val Lys Ala Pro Thr Gly Gly His Pro Val Val Gin Leu His Gly Tyr 

435 440 445 

Met Glu Asn Lys Pro Leu Gly Leu Gin He Phe He Gly Thr Ala Asp 

450 455 460 

Glu Arg He Leu Lys Pro His Ala Phe Tyr Gin Val His Arg He Thr 
465 470 475 480 

Gly Lys Thr Val Thr Thr Thr Ser Tyr Glu Lys He Val Gly Asn Thr 

485 490 495 

Lys Val Leu Glu He Pro Leu Glu Pro Lys Asn Asn Met Arg Ala Thr 

500 505 510 

He Asp Cys Ala Gly He Leu Lys Leu Arg Asn Ala Asp He Glu Leu 

515 520 525 

Arg Lys Gly Glu Thr Asp He Gly Arg Lys Asn Thr Arg Val Arg Leu 
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Val Arg Val His lie Pro Glu Ser Ser Gly xie Val Ser Leu 

Gin Thr Ala Ser Asn Pro lie Glu Cys Ser III Arg Ser Ala His llu 



570 



Leu Pro Met Val Glu Arg Gin Asp Thr Asp Ser Cys Leu Val ^r Gly 

Gly Gin Gin Met He Leu Thr Gly III Asn Phe Thr Ser III Ser Lys 

val val Phe Thr Glu Lys Thr Thr Asp Gly Gin Gin III Trp Glu Met 

Glu Ala Thr Val Asp Lys L'p Lys Ser Gin Pro 111 Met Leu Phe Val 

Glu He Pro Glu Tyr Arg Asn Lys His He Irg Thr Pro Val Lys til 

^ 650 ficc 

Asn Phe Tyr Val He Asn Gly Lys Arg Lys Arg Ser Gin Pro Gin His 

665 fi7n 
Phe Thr Tyr His Pro Val Pro Ala He Lys Thr Glu Pro ^hr Asp Glu 

660 685 
Tyr ASP Pro Thr Leu He Cys Ser Pro Thr His Gly Gly Leu Gly Ser 

Gin Pro Tyr Tyr Pro Gin "s Pro Met Val Ala III Ser Pro Ser Cys 

715 

Leu val Ala Thr Met Ala Pro Cys Gin Gin Phe Arg Thr Gly Leu Ser 

ser Pro Asp Ala Arg Tyr Gin Gin Gin III Pro Ala Ala Val l" Tyr 

745 -Tcn 
Gin Arg Ser Lys Ser Leu Ser Pro Ser Leu Leu Gly Tyr Gin Gin Pro 



760 



Ala Leu Met Ala Ala Pro Leu s;; Leu Ala Asp Ala 111 Arg Ser Val 



770 



' ' **■ 78 0 

Leu val His Ala Gly Ser Gin Gly Gin Ser Ser Ala Lau L.u Hi. Pro 



790 



ser Pro Thr Asn Gin Gin Ala Ser Pro Val Il'e His Tyr Ser Pro Jhr 

810 aiq 

820 """^ ""^^ ""'^ J^'s ^""^ 

ryr cys Glu Asn Phe Ala Pro Gly ^hr Thr Arg Pro Gly Pro Pro Pro 

840 

val ser Gin Gly Gin Arg Leu Ser Pro Gly Ser Tyr Pro Thr Val He 

855 860 
Gin Gin Gin Asn Ala Thr Ser Gin Arg Ala Ala Lys Asn Gly Pro Pro 

val ser Asp Gin Lys Glu Val Leu Pro Ala Gly Val Thr He Lys "n 

Glu Gin Asn Leu Asp Gin Thr Tyr Leu "p Asp Val Asn Glu He He 

SOS 

Arg Lys Glu Phe Ser Gly Pro Pro Ala Arg Asn Gin Thr A^g He Leu 

Gin ser Thr Val Pro Arg Ala "g Asp Pro Pro Val III Thr Met Val 

ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val III He Leu Val Glu 

950 955 
Leu ASP Gly ASP val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gl'y 

970 

Glu Gly ASP Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr 
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980 985 990 

Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 

995 1000 1005 . 

Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His 
5 1010 1015 1020 

Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr 
025 1030 1035 1040 

He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 
1045 1050 1055 

10 Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp 
1060 1065 1070 

Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr 

1075 1080 1085 

Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He 
15 1090 1095 1100 

Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin 
105 1110 1115 1120 

Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val 
1125 1130 1135 

20 Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 
1140 1145 1150 

Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 

1155 1160 1165 

Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
25 1170 1175 1180 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 2 802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2799 
40 (D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

45 Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
50 .20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 



55 



TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
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Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu VaX Thr Thr 
=° 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp Zt 
" '° 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 35 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
10° 105 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTC AAG GGC 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 



GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 3^3^, 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTC 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 
ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



240 



288 



336 



432 



AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
25 Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

150 155 



528 



576 



624 



672 



45 vTl Th^ ^T"" '''''' ^''^ TAC AAG TCC 720 

45 val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 

230 235 

GGA CTC AGA TCT CGA GGG AGC ATG GGC ACC TTG CGG GAT TTA CAG TAC 
Gly Leu Arg Ser Arg Gly ser Met Gly Thr Leu Arg Asp Leu Gin Tyr 
245 250 255 

GCG CTC CAG GAG AAG ATC GAG GAG CTG AGG CAG CGG GAT GCT CTC ATC 
Ala Leu Gin Glu Lys He Glu Glu Leu Arg Gin Arg Asp Ala Leu He 
260 265 270 

GAC GAG CTG GAG CTG GAG TTG GAT CAG AAG GAC GAA CTG ATC CAG AAG 



768 



616 



864 

270 
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Asp Glu Leu Glu Leu Glu Leu Asp Gin Lys Asp Glu Leu lie Gin Lys 
275 280 285 

CTG CAG AAC GAG CTG GAG AAG TAG CGC TCG GTG ATC CGA CCA GCC ACC 912 
5 Leu Gin Asn Glu Leu Asp Lys Tyr Arg Ser Val lie Arg Pro Ala Thr 
290 295 300 

CAG CAG GCG CAG AAG CAG AGC GCG AGC ACC TTG CAG GGC GAG CC6 CGC 960 
Gin Gin Ala Gin Lys Gin Ser Ala Ser Thr Leu Gin Gly Glu Pro Arg 
10 305 310 315 320 

ACC AAG CGG CAG GCG ATC TCC GCC GAG CCC ACC GCC TTC GAC ATC CAG 1008 
Thr Lys Arg Gin Ala lie Ser Ala Glu Pro Thr Ala Phe Asp lie Gin 
32S 330 335 

15 

GAT CTC AGC CAT GTG ACC CTG CCC TTC TAC CCC AAG AGC CCA CAG TCC 1056 
Asp Leu Ser His Val Thr Leu Pro Phe Tyr Pro Lys Ser Pro Gin Ser 
340 345 350 

20 AAG GAT CTT ATA AAG GAA GCT ATC CTT GAC AAT GAC TTT ATG AAG AAC 1104 
Lys Asp Leu lie Lys Glu Ala lie Leu Asp Asn Asp Phe Met Lys Asn 
355 360 365 

TTG GAG CTG TCG CAG ATC CAG GAG ATT GTG GAT TGT ATG TAC CCG GTG 1152 
25 Leu Glu Leu Ser Gin lie Gin Glu lie Val Asp Cys Met Tyr Pro Val 
370 375 380 

GAG TAT GGC AAG GAC AGT TGC ATC ATC AAA GAA GGA GAC GTG GGG TCA 1200 
Glu Tyr Gly Lys Asp Ser Cys He He Lys Glu Gly Asp Val Gly Ser 
30 385 390 395 400 

CTG GTG TAT GTC ATG GAA GAT GGT AAG GTT GAA GTT ACA AAA GAA GGT 124 8 

Leu Val Tyr Val Met Glu Asp Gly Lys Val Glu Val Thr Lys Glu Gly 
405 410 415 

35 

GTG AAG TTG TGT ACC ATG GGT CCA GGA AAA GTG TTT GGG GAA TTG GCT 1296 

Val Lys Leu Cys Thr Met Gly Pro Gly Lys Val Phe Gly Glu Leu Ala 
420 425 430 

40 ATT CTT TAC AAC TGT ACC CGG ACA GCG ACC GTC AAG ACT CTT GTA AAT 1344 
He Leu Tyr Asn Cys Thr Arg Thr Ala Thr Val Lys Thr Leu Val Asn 
435 440 445 

GTA AAA CTC TGG GCC ATT GAT CGA CAA TGT TTT CAA ACA ATA ATG ATG 1392 
45 Val Lys Leu Trp Ala He Asp Arg Gin Cys Phe Gin Thr He Met Met 
450 455 460 

AGG ACA GGA CTC ATC AAG CAT ACC GAG TAT ATG GAA TTT TTA AAA AGC 144 0 

Arg Thr Gly Leu He Lys His Thr Glu Tyr Met Glu Phe Leu Lys Ser 
50 465 470 475 480 

GTT CCA ACA TTC CAG AGC CTT CCT GAA GAG ATC CTC AGC AAG CTT GCT 14 8 8 

Val Pro Thr Phe Gin Ser Leu Pro Glu Glu He Leu Ser Lys Leu Ala 
485 490 495 



55 



GAT GTC CTT GAA GAG ACC CAC TAT GAA AAT GGA GAA TAT ATT ATC AGG 153 6 
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Asp Val Leu Glu GIu Thr His Tyr Glu Asn Gly Glu Tyr lie He Arg 
500 505 

CAA GGT GCA AGA GGG GAC ACC TTC TTT ATC ATC AGC AAA GGA ACQ GTA 
Gin Gly Ala Arg Gly Asp Thr Phe Phe He He Ser Lys Gly Thr Val 
515 520 525 

AAT GTC ACT CGT GAA GAC TCA CCG AGT GAA GAC CCA QTC TTT CTT AGA 
Asn val Thr Arg Glu Asp Ser Pro Ser Glu Asp Pro Val Phe Leu Arg 
530 535 

ACT TTA GGA AAA GGA GAC TGG TTT GGA GAG AAA GCC TTG CAG GGG GAA 
Thr Leu Gly Lys Gly Asp Trp Phe Gly Glu Lys Ala Leu Gin Gly Glu 
550 555 

GAT GTG AGA ACA GCA AAC GTA ATT GCT GCA GAA GCT GTA ACC TGC CTT 
Asp val Arg Thr Ala Asn Val He Ala Ala Glu Ala Val Thr Cys Leu 
565 570 

GTG ATT GAC AGA GAC TCT TTT AAA CAT TTG ATT GGA GGG CTG GAT GAT 
Val He Asp Arg Asp Ser Phe Lys His Leu He Gly Gly Leu Asp Asp 
580 5B5 590 

GTT TCT AAT AAA GCA TAT GAA GAT GCA GAA GCT AAA GCA AAA TAT GAA 
Val Ser Asn Lys Ala Tyr Glu Asp Ala Glu Ala Lys Ala Lys Tyr Glu 
595 600 605 

GCT GAA GCG GCT TTC TTC GCC AAC CTG AAG CTG TCT GAT TTC AAC ATC 
Ala Glu Ala Ala Phe Phe Ala Asn Leu Lys Leu Ser Asp ^n H^ 

615 620 

tr °" ""^ GTA GAA CTG GTC CAG 

He Asp Thr Leu Gly Val Gly Gly Phe Gly Arg Val Glu Leu Val Gin 

"° 635 640 

TTG AAA AGT GAA GAA TCC AAA ACG TTT GCA ATG AAG ATT CTC AAG AAA 
Leu Lys ser Glu Glu Ser Lys Thr Phe Ala Met Lys He Leu Lys Lys 
"5 650 655 

SI t7 f,"^ T"" "^'^ ^^'^ TCA GAG AAG 

Arg Has He Val Asp Thr Arg Gin Gin Glu His He Arg Ser Glu Lys 

®" 665 670 



CAG ATC ATG CAG GGG GCT CAT TCC GAT TTC ATA GTG AGA CTG TAC AGA 
Gin He Met Gin Gly Ala His Ser Asp Phe He Val Arg Leu 5r Jrg 
"5 680 685 

ACA TTT AAG GAC AGC AAA TAT TTG TAT ATO TTG ATG GAA GCT TGT CTA 
Thr Phe Lys Asp Ser Lys Tyr Leu Tyr Met Leu Met Glu Ala Cys Leu 

695 700 

GGT GGA GAG CTC TGG ACC ATT CTC AGG GAT AGA GGT TCG TTT GAA GAT 
Gly Gly Glu Leu Trp Thr He Leu Arg Asp Arg Gly Ser Phe Isl 
" 715 720 

TCT ACA ACC AGA TTT TAC ACA GCA TGT GTG GTA GAA GCT TTT GCC TAT 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



2112 



2160 



2208 

272 
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Ser Thr Thr Arg Phe Tyr Thr Ala Cys Val Val Glu Ala Phe Ala Tyr 
725 730 735 

CTG CAT TCC AAA GGA ATC ATT TAG AGG GAG CTC AAG CCA GAA AAT CTC 2256 
6 Leu His Ser Lys Gly lie lie Tyr Arg Asp Leu Lys Pro Glu Asn Leu 
740 745 750 

ATC CTA GAT CAC CGA GGT TAT GCC AAA CTG GTT GAT TTT GGC TTT GCA 2304 
lie Leu Asp His Arg Gly Tyr Ala Lys Leu Val Asp Phe Gly Phe Ala 
10 755 760 765 

AAG AAA ATA GGA TTT GGA AAG AAA ACA TGG ACT TTT TGT GGG ACT CCA 23 52 

Lys Lys He Gly Phe Gly Lys Lys Thr Trp Thr Phe Cys Gly Thr Pro 

770 775 780 

15 

GAG TAT GTA GCC CCA GAG ATC ATC CTG AAC AAA GGC CAT GAC ATT TCA 2400 

Glu Tyr Val Ala Pro Glu He He Leu Asn Lys Gly His Asp He Ser 
785 790 795 800 

20 GCC GAC TAG TGG TCA CTG GGA ATC CTA ATG TAT GAA CTC CTG ACT GGC 2448 

Ala Asp Tyr Trp Ser Leu Gly He Leu Met Tyr Glu Leu Leu Thr Gly 
805 810 815 

AGC CCA CCT TTC TCA GGC CCA GAT CCT ATG AAA ACC TAT AAC ATC ATA 2496 
25 Ser Pro Pro Phe Ser Gly Pro Asp Pro Met Lys Thr Tyr Asn He He 
820 825 830 

TTG AGG GGG ATT GAC ATG ATA GAA TTT CCA AAG AAG ATT GCC AAA AAT 2544 
Leu Arg Gly He Asp Met He Glu Phe Pro Lys Lys He Ala Lys Asn 
30 835 840 845 

GCT GCT AAT TTA ATT AAA AAA CTA TGC AGG GAC AAT CCA TCA GAA AGA 2592 
Ala Ala Asn Leu He Lys Lys Leu Cys Arg Asp Asn Pro Ser Glu Arg 
850 855 860 

35 

TTA GGG AAT TTG AAA AAT GGA GTA AAA GAC ATT CAA AAG CAC AAA TGG 2640 
Leu Gly Asn Leu Lys Asn Gly Val Lys Asp He Gin Lys His Lys Trp 
865 870 875 880 

40 TTT GAG GGC TTT AAC TGG GAA GGC TTA AGA AAA GGT ACC TTG ACA CCT 2688 

Phe Glu Gly Phe Asn Trp Glu Gly Leu Arg Lys Gly Thr Leu Thr Pro 
885 890 895 

CCT ATA ATA CCA AGT GTT GCA TCA CCC ACA GAC ACA AGT AAT TTT GAC 2736 
45 Pro He He Pro Ser Val Ala Ser Pro Thr Asp Thr Ser Asn Phe Asp 
900 905 910 

AGT TTC CCT GAG GAC AAC GAT GAA CCA CCA CCT GAT GAC AAC TCA GGA 2784 
Ser Phe Pro Glu Asp Asn Asp Glu Pro Pro Pro Asp Asp Asn Ser Gly 
50 915 920 925 

TGG GAT ATA GAC TTC TAA 2 802 

Trp Asp He Asp Phe 
930 

55 
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(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 933 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

Met Val ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

^5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

55 60 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

. 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

=^00 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
, 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

170 3^75 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

1^5 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

215 220 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
230 235 240 

Gly Leu Arg Ser Arg Gly Ser Met Gly Thr Leu Arg Asp Leu Gin Tyr 

245 250 255 

Ala Leu Gin Glu Lys He Glu Glu Leu Arg Gin Arg Asp Ala Leu He 

265 270 
Asp Glu Leu Glu Leu Glu Leu Asp Gin Lys Asp Glu Leu He Gin Lys 

280 285 
Leu Gin Asn Glu Leu Asp Lys Tyr Arg Ser Val He Arg Pro Ala Thr 

290 295 300 

Gin Gin Ala Gin Lys Gin Ser Ala Ser Thr Leu Gin Gly Glu Pro Arg 

310 315 
Thr Lys Arg Gin Ala He Ser Ala Glu Pro Thr Ala Phe Asp He Gin 

325 330 
Asp Leu Ser His Val Thr Leu Pro Phe Tyr Pro Lys Ser Pro Gin Ser 
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340 345 350 





Lys 


Asp 


Leu 


He 


Lys 


Glu 


Ala 


He 


Leu 


Asp 


Asn 


Asp 


Phe 


Met 


Lys 


Asn 








355 










360 










365 










Leu 


Glu 


Leu 


Ser 


Gin 


He 


Gin 


Glu 


He 


Val 


Asp 


Cys 


Met 


Tyr 


Pro 


Val 


5 




370 










375 










380 












Glu 


Tyr 


Gly 


Lys 


Asp 


Ser 


Cys 


He 


He 


Lys 


Glu 


Gly Asp 


Val 


Gly 


Ser 




385 










390 










395 










400 




Leu 


Val 


Tyr 


Val 


Met 


Glu 


Asp 


Gly 


Lys 


Val 


Glu 


Val 


Thr 


Lys 


Glu 


Gly 












405 










410 










415 




10 


Val 


Lys 


Leu 


Cys 


Thr 


Met 


Gly 


Pro 


Gly 


Lys 


Val 


Phe 


Gly Glu 


Leu 


Ala 










420 










425 










430 








lie 


Leu 


Tyr 


Asn 


Cys 


Thr 


Arg 


Thr 


Ala 


Thr 


Val 


Lys 


Thr 


Leu 


Val 


Asn 








435 










440 










445 










Val 


Lys 


Leu 


Trp 


Ala 


He 


Asp 


Arg 


Gin 


Cys 


Phe 


Gin 


Thr 


He 


Met 


Met 


15 




450 










455 










460 












Arg 


Thr 


Gly 


Leu 


He 


Lys 


His 


Thr 


Glu 


Tyr 


Met 


Glu 


Phe 


Leu 


Lys 


Ser 




465 










470 










475 










480 




Val 


Pro 


Thr 


Phe 


Gin 


Ser 


Leu 


Pro 


Glu 


Glu 


He 


Leu 


Ser 


Lys 


Leu 


Ala 












485 










490 










495 




20 


Asp 


Val 


Leu 


Glu 


Glu 


Thr 


His 


Tyr 


Glu 


Asn 


Gly 


Glu 


Tyr 


He 


He 


Arg 










500 










505 










510 








Gin 


Gly 


Ala 


Arg 


Gly 


Asp 


Thr 


Phe 


Phe 


He 


He 


Ser 


Lys 


Gly 


Thr 


Val 








515 










520 










525 










Asn 


Val 


Thr 


Arg 


Glu 


Asp 


Ser 


Pro 


Ser 


Glu 


Asp 


Pro 


Val 


Phe 


Leu 


Arg 


25 




530 










535 










540 












Thr 


Leu 


Gly 


Lys 


Gly 


Asp 


Trp 


Phe 


Gly 


Glu 


Lys 


Ala 


Leu 


Gin 


Gly 


Glu 




545 










550 










555 










560 




Asp 


Val 


Arg 


Thr 


Ala 


Asn 


Val 


He 


Ala 


Ala 


Glu 


Ala 


Val 


Thr 


Cys 


Leu 












565 










570 










575 




30 


Val 


lie 


Asp 


Arg 


Asp 


Ser 


Phe 


Lys 


His 


Leu 


He 


Gly 


Gly 


Leu 


Asp 


Asp 










590 










585 










590 








Val 


Ser 


Asn 


Lys 


Ala 


Tyr 


Glu 


Asp 


Ala 


Glu 


Ala 


Lys 


Ala 


Lys 


Tyr 


Glu 








595 










600 










605 










Ala 


Glu 


Ala 


Ala 


Phe 


Phe 


Ala 


Asn 


Leu 


Lys 


Leu 


Ser Asp 


Phe 


Asn 


He 


35 




610 










615 










620 












lie 


Asp 


Thr 


Leu 


Gly 


Val 


Gly 


Gly 


Phe 


Gly 


Arg 


Val 


Glu 


Leu 


Val 


Gin 




625 










630 










635 










640 




Leu 


Lys 


Ser 


Glu 


Glu 


Ser 


Lys 


Thr 


Phe 


Ala 


Met 


Lys 


He 


Leu 


Lys 


Lys 












645 










650 










655 




40 


Arg 


His 


lie 


Val 


Asp 


Thr 


Arg 


Gin 


Gin 


Glu 


His 


He 


Arg 


Ser 


Glu 


Lys 










660 










665 










670 








Gin 


lie 


Met 


Gin 


Gly 


Ala 


His 


Ser 


Asp 


Phe 


He 


Val 


Arg 


Leu 


Tyr 


Arg 








675 










680 










685 










Thr 


Phe 


Lys 


Asp 


Ser 


Lys 


Tyr 


Leu 


Tyr 


Met 


Leu 


Met 


Glu 


Ala 


Cys 


Leu 


45 




690 










695 










700 












Gly 


Gly 


Glu 


Leu 


Trp 


Thr 


He 


Leu 


Arg 


Asp 


Arg 


Gly 


Ser 


Phe 


Glu 


Asp 




705 










710 










715 










720 




Ser 


Thr 


Thr 


Arg 


Phe 


Tyr 


Thr 


Ala 


Cys 


Val 


Val 


Glu 


Ala 


Phe 


Ala 


Tyr 












725 










730 










735 




50 


Leu 


His 


Ser 


Lys 


Gly 


He 


He 


Tyr 


Arg 


Asp 


Leu 


Lys 


Pro 


Glu 


Asn 


Leu 










740 










745 










750 








lie 


Leu 


Asp 


His 


Arg 


Gly 


Tyr 


Ala 


Lys 


Leu 


Val 


Asp 


Phe 


Gly 


Phe 


Ala 








755 










760 










765 










Lys 


Lys 


He 


Gly 


Phe 


Gly 


Lys 


Lys 


Thr 


Trp 


Thr 


Phe 


Cys 


Gly 


Thr 


Pro 


55 




770 










775 










780 












Glu 


Tyr 


Val 


Ala 


Pro 


Glu 


He 


He 


Leu 


Asn 


Lys 


Gly 


His 


Asp 


He 


Ser 
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I" "0 795 800 

Ala Asp Tyr Trp Ser Leu Gly He Leu Met Tyr Glu Leu Leu Thr Gly 

805 810 815 

Ser Pro Pro Phe Ser Gly Pro Asp Pro Met Lys Thr Tyr Asa He He 

820 825 830 

Leu Arg Gly He Asp Met He Glu Phe Pro Lys Lys He Ala Lys Asn 

835 840 845 

Ala Ala Asn Leu He Lys Lys Leu Cys Arg Asp Asn Pro Ser Glu Arq 

850 855 860 

Leu Gly Asn Leu Lys Asn Gly Val Lys Asp He Gin Lys His Lys Trp 
, ^■'O 875 880 

Phe Glu Gly Phe Asn Trp Glu Gly Leu Arg Lys Gly Thr Leu Thr Pro 

885 890 895 

Pro He He Pro Ser Val Ala Ser Pro Thr Asp Thr Ser Asn Phe Asp 

500 905 910 

Ser Phe Pro Glu Asp Asn Asp Glu Pro Pro Pro Asp Asp Asn Ser Gly 

915 920 925 

Trp Asp He Asp Phe 
930 

(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2795 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

ATG GGC ACC TTG CGG GAT TTA CAG TAG GCG CTC GAG GAG AAG ATC GAG 48 
Met Gly Thr Leu Arg Asp Leu Gin Tyr Ala Leu Gin Glu Lys He Glu 
1 5 10 



15 



GAG CTG AGO CAG CGG GAT GCT CTC ATC GAC GAG CTG GAG CTG GAG TTG 96 
Glu Leu Arg Gin Arg Asp Ala Leu He Asp Glu Leu Glu Leu Glu Leu 
20 25 30 



GAT CAG AAG GAC GAA CTG ATC CAG AAG CTG CAG AAC GAG CTG GAC AAG 

Asp Gin Lys Asp Glu Leu He Gin Lys Leu Gin Asn Glu Leu Asp Lys 
35 40 45 

TAC CGC TOG GTG ATC CGA CCA GCC ACC CAG CAG GCG CAG AAG CAG AGC 
Tyr Arg Ser Val He Arg Pro Ala Thr Gin Gin Ala Gin Lys Gin Ser 
5° 55 60 

GCG AGC ACC TTG CAG GGC GAG CCG CGC ACC AAG CGG CAG GCG ATC TCC 
Ala Ser Thr Leu Gin Gly Glu Pro Arg Thr Lys Arg Gin Ala He Ser 
" 70 75 80 



144 



192 



240 
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GCC GAG CCC ACC GCC TTC GAC ATC CAG GAT CTC AGC CAT GTG ACC CTG 288 

Ala Glu Pro Thr Ala Phe Asp lie Gin Asp Leu Ser His Val Thr Leu 

85 90 95 

5 

CCC TTC TAC CCC AAG AGC CCA CAG TCC AAG GAT CTT ATA AAG GAA GCT 336 

Pro Phe Tyr Pro Lys Ser Pro Gin Ser Lys Asp Leu lie Lys Glu Ala 
100 105 110 

10 ATC CTT GAC AAT GAC TTT ATG AAG AAC TTG GAG CTG TCG CAG ATC CAG 384 
lie Leu Asp Asn Asp Phe Met Lys Asn Leu Glu Leu Ser Gin lie Gin 
115 120 125 

GAG ATT GTG GAT TGT ATG TAC CCG GTG GAG TAT GGC AAG GAC AGT TGC 432 
15 Glu lie Val Asp Cys Met Tyr Pro Val Glu Tyr Gly Lys Asp Ser Cys 
130 135 140 

ATC ATC AAA GAA GGA GAC GTG GGG TCA CTG GTG TAT GTC ATG GAA GAT 480 
He He Lys Glu Gly Asp Val Gly Ser Leu Val Tyr Val Met Glu Asp 
20 145 150 155 160 

GGT AAG GTT GAA GTT ACA AAA GAA GGT GTG AAG TTG TGT ACC ATG GGT 528 

Gly Lys Val Glu Val Thr Lys Glu Gly Val Lys Leu Cys Thr Met Gly 
165 170 175 

25 

CCA GGA AAA GTG TTT GGG GAA TTG GCT ATT CTT TAC AAC TGT ACC CGG 576 

Pro Gly Lys Val Phe Gly Glu Leu Ala He Leu Tyr Asn Cys Thr Arg 
180 185 190 

30 ACA GCG ACC GTC AAG ACT CTT GTA AAT GTA AAA CTC TGG GCC ATT GAT 624 
Thr Ala Thr Val Lys Thr Leu Val Asn Val Lys Leu Trp Ala He Asp 
195 200 205 

CGA CAA TGT TTT CAA ACA ATA ATG ATG AGG ACA GGA CTC ATC AAG CAT 672 
35 Arg Gin Cys Phe Gin Thr He Met Met Arg Thr Gly Leu He Lys His 
210 215 220 

ACC GAG TAT ATG GAA TTT TTA AAA AGC GTT CCA ACA TTC CAG AGC CTT 720 
Thr Glu Tyr Met Glu Phe Leu Lys Ser Val Pro Thr Phe Gin Ser Leu 
40 225 230 235 240 

CCT GAA GAG ATC CTC AGC AAG CTT GCT GAT GTC CTT GAA GAG ACC CAC 768 
Pro Glu Glu He Leu Ser Lys Leu Ala Asp Val Leu Glu Glu Thr His 
245 250 255 

45 

TAT GAA AAT GGA GAA TAT ATT ATC AGG CAA GGT GCA AGA GGG GAC ACC 816 
Tyr Glu Asn Gly Glu Tyr He He Arg Gin Gly Ala Arg Gly Asp Thr 
260 265 270 

50 TTC TTT ATC ATC AGC AAA GGA ACG GTA AAT GTC ACT CGT GAA GAC TCA 864 

Phe Phe He He Ser Lys Gly Thr Val Asn Val Thr Arg Glu Asp Ser 
275 280 285 

CCG AGT GAA GAC CCA GTC TTT CTT AGA ACT TTA GGA AAA GGA GAC TGG 912 
55 Pro Ser Glu Asp Pro Val Phe Leu Arg Thr Leu Gly Lys Gly Asp Trp 
290 295 300 
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TTT GGA GAG AAA GCC TTG CAG GGG GAA GAT GTG AGA ACA GCA AAC GTA 960 
Phe Gly Glu Lys Ala Leu Gin Gly Glu Asp Val Arg Thr Ala Asn Val 
31° 315 320 

ATT GCT GCA GAA GOT GTA ACC TGC CTT GTG ATT GAC AGA GAC TCT TTT 1008 
He Ala Ala Glu Ala Val Thr Cys Leu Val He Asp Arg Asp Ser Phe 
325 330 

AAA CAT TTG ATT GGA GGG CTG GAT GAT GTT TCT AAT AAA GCA TAT GAA 
Lys His Leu He Gly Qly Leu Asp Asp Val Ser Asn Lys Ala Tyr Glu 

345 350 

GAT GCA GAA GCT AAA GCA AAA TAT GAA GCT GAA GCG GCT TTC TTC GCC 
Asp Ala Glu Ala Lys Ala Lys Tyr Glu Ala Glu Ala Ala Phe Phe Ala 

360 365 

AAC CTG AAG CTG TCT GAT TTC AAC ATC ATT GAT ACC CTT GGA GTT GGA 
Asn Leu Lys Leu Ser Asp .Phe Asn He lie Asp Thr Leu Gly Val Gly 
"0 375 380 

GGT TTC GGA CGA GTA GAA CTG GTC CAG TTG AAA AGT GAA GAA TCC AAA 
Gly Phe Gly Arg Val Glu Leu Val Gin Leu Lys Ser Glu Glu Ser Lvs 

390 395 400 

ACG TTT GCA ATG AAG ATT CTC AAG AAA CGT CAC ATT GTG GAC ACA AGA 
Thr Phe Ala Met Lys He Leu Lys Lys Arg His He Val Asp Thr Arg 
*05 

CAG CAG GAG CAC ATC CGC TCA GAG AAG CAG ATC ATG CAG GGG GCT CAT 
Gin Gin Glu His He Arg Ser Glu Lys Gin He Met Gin Gly Ala His 
*20 425 430 

TCC GAT TTC ATA GTG AGA CTG TAC AGA ACA TTT AAG GAC AGC AAA TAT 
ser Asp Phe He Val Arg Leu Tyr Arg Thr Phe Lys Asp Ser Lys Tyr 
435 440 



TTG TAT ATG TTG ATG GAA GCT TGT CTA GGT GGA GAG CTC TGG ACC ATT 
Leu Tyr Met Leu Met Glu Ala Cys Leu Gly Gly Glu Leu Trp Thr He 

455 460 



CTC AGG GAT AGA GGT TCG TTT GAA GAT TCT ACA ACC AGA TTT TAC ACA 
Leu Arg Asp Arg Gly Ser Phe Glu Asp Ser Thr Thr Arg Phe Tyr Thr 
^" 470 475 480 

GCA TGT GTG GTA GAA GCT TTT GCC TAT CTG CAT TCC AAA GGA ATC ATT 
Ala Cys Val Val Glu Ala Phe Ala Tyr Leu His Ser Lys Gly He He 
485 490 495 

TAC AGG GAC CTC AAG CCA GAA AAT CTC ATC CTA GAT CAC CGA GGT TAT 
Tyr Arg Asp Leu Lys Pro Glu Asn Leu He Leu Asp His Arg Gly Tyr 
500 505 



GCC AAA CTG GTT GAT TTT GGC TTT GCA AAG AAA ATA GGA TTT GGA AAG 
Ala Lys Leu Val Asp Phe Gly Phe Ala Lys Lys He Gly Phe Gly Lys 

520 525 



1056 



1104 



1152 



1200 



124 8 



1296 



1344 



1392 



1440 



1488 



1536 



1584 
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AAA ACA TGG ACT TTT TGT GGG ACT CCA GAG TAT GTA GCC CCA GAG ATC 1632 

Lys Thr Trp Thr Phe Cys Gly Thr Pro Glu Tyr Val Ala Pro Glu lie 
530 535 540 

5 

ATC CTG AAC AAA GGC CAT GAC ATT TCA GCC GAC TAC TGG TCA CTG GGA 1680 

lie Leu Asn Lys Gly His Asp lie Ser Ala Asp Tyr Trp Ser Leu Gly 
545 550 555 560 

10 ATC CTA ATG TAT GAA CTC CTG ACT GGC AGC CCA CCT TTC TCA GGC CCA 1728 
He Leu Met Tyr Glu Leu Leu Thr Gly Ser Pro Pro Phe Ser Gly Pro 
565 570 575 

GAT CCT ATG AAA ACC TAT AAC ATC ATA TTG AGG GGG ATT GAC ATG ATA 1776 
15 Asp Pro Met Lys Thr Tyr Asn He He Leu Arg Gly He Asp Met He 
580 585 590 

GAA TTT CCA AAG AAG ATT GCC AAA AAT GCT GCT AAT TTA ATT AAA AAA 1824 
Glu Phe Pro Lys Lys He Ala Lys Asn Ala Ala Asn Leu He Lys Lys 
20 595 600 605 

CTA TGC AGG GAC AAT CCA TCA GAA AGA TTA GGG AAT TTG AAA AAT GGA 1872 

Leu Cys Arg Asp Asn Pro Ser Glu Arg Leu Gly Asn Leu Lys Asn Gly 
610 615 620 

25 

GTA AAA GAC ATT CAA AAG CAC AAA TGG TTT GAG GGC TTT AAC TGG GAA 1920 

Val Lys Asp He Gin Lys His Lys Trp Phe Glu Gly -Phe Asn Trp Glu 
625 630 635 640 

30 GGC TTA AGA AAA GGT ACC TTG ACA CCT CCT ATA ATA CCA AGT GTT GCA 1968 
Gly Leu Arg Lys Gly Thr Leu Thr Pro Pro He He Pro Ser Val Ala 
645 650 655 

TCA CCC ACA GAC ACA AGT AAT TTT GAC AGT TTC CCT GAG GAC AAC GAT 2016 
35 Ser Pro Thr Asp Thr Ser Asn Phe Asp Ser Phe Pro Glu Asp Asn Asp 
660 665 670 

GAA CCA CCA CCT GAT GAC AAC TCA GGA TGG GAT ATA GAC TTC TCG GAT 2064 
Glu Pro Pro Pro Asp Asp Asn Ser Gly Trp Asp He Asp Phe Ser Asp 
40 675 680 685 

CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG 2112 
Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
690 695 700 

45 

GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG 2160 
Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
705 710 715 720 

50 TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG 220 8 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
725 730 735 

ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC 2256 
55 Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
740 745 750 
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ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAG 
Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 
755 760 

CCC GAG CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA 
Pro Asp Has Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
"° 775 780 

GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC 
Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
■'^O 795 800 



(2) INFORMATION FOR SEQ ID NO: 13 7; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 



2304 



2352 



2400 



1^ ^ """^ '^'^^ ACC CTG GTG AAC CGC 244 8 

15 Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 

810 

ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG 2496 
He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
820 825 830 

CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC 
Has Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala 
835 840 845 



2544 



2592 



2640 



GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC 
Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 

855 860 

ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC 
He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
°" 870 875 880 

35 Pro H» f '''''' """^ "^^^ ^''^ -^AC CTG AGC 2688 

35 Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 

890 895 

?S 111 sfr r° ^"^"^ '^^^ CAC ATG 2736 

Thr Gin ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 

905 

GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC 2784 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 
"5 920 925 

GAG CTG TAC AA GTAA 

Glu Leu Tyr Lys 2799 
930 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 





Met 


Gly 


Thr 


Leu 


Arg 


Asp 


Leu 


Gin 


Tyr 


Ala 


Leu 


Gin 


Glu 


Lys 


He 


Glu 




1 








5 










10 










15 






Glu 


Leu 


Arg 


Gin 


Arg 


Asp 


Ala 


Leu 


He 


Asp 


Glu 


Leu 


Glu 


Leu 


Glu 


Leu 










20 










25 










30 








Asp 


Gin 


Lys 


Asp 


Glu 


Leu 


He 


Gin 


Lys 


Leu 


Gin 


Asn 


Glu 


Leu 


Asp 


Lys 








35 










40 










45 










Tyr 


Arg 


Ser 


Val 


He 


Arg 


Pro 


Ala 


Thr 


Gin 


Gin 


Ala 


Gin 


Lys 


Gin 


Ser 


15 




50 










55 










60 












Ala 


Ser 


Thr 


Leu 


Gin 


Gly 


Glu 


Pro 


Arg 


Thr 


Lys 


Arg 


Gin 


Ala 


He 


Ser 




65 










70 










75 










80 




Ala 


Glu 


Pro 


Thr 


Ala 


Phe 


Asp 


He 


Gin 


Asp 


Leu 


Ser 


His 


Val 


Thr 


Leu 












85 










90 










95 




20 


Pro 


Phe 


Tyr 


Pro 


Lys 


Ser 


Pro 


Gin 


Ser 


Lys 


Asp 


Leu 


He 


Lys 


Glu 


Ala 










100 










105 










110 








He 


Leu 


Asp 


Asn 


Asp 


Phe 


Met 


Lys 


Asn 


Leu 


Glu 


Leu 


Ser 


Gin 


He 


Gin 








115 










120 










125 










Glu 


He 


Val 


Asp 


Cys 


Met 


Tyr 


Pro 


Val 


Glu 


Tyr 


Gly 


Lys 


Asp 


Ser 


Cys 


25 




130 










135 










140 












He 


He 


Lys 


Glu 


Gly 


Asp 


Val 


Gly 


Ser 


Leu 


Val 


Tyr 


Val 


Met 


Glu 


Asp 




145 










150 










155 










160 




Gly 


Lys 


Val 


Glu 


Val 


Thr 


Lys 


Glu 


Gly 


Val 


Lys 


Leu 


Cys 


Thr 


Met 


Gly 












165 










170 










175 




30 


Pro 


Gly 


Lys 


Val 


Phe 


Gly 


Glu 


Leu 


Ala 


He 


Leu 


Tyr 


Asn 


Cys 


Thr 


Arg 










180 










185 










190 








Thr 


Ala 


Thr 


Val 


Lys 


Thr 


Leu 


Val 


Asn 


Val 


Lys 


Leu 


Trp 


Ala 


He 


Asp 








195 










200 










205 










Arg 


Gin 


Cys 


Phe 


Gin 


Thr 


He 


Met 


Met 


Arg 


Thr 


Gly 


Leu 


He 


Lys 


His 


35 




210 










215 










220 












Thr 


Glu 


Tyr 


Met 


Glu 


Phe 


Leu 


Lys 


Ser 


Val 


Pro 


Thr 


Phe 


Gin 


Ser 


Leu 




225 










230 










235 










240 




Pro 


Glu 


Glu 


He 


Leu 


Ser 


Lys 


Leu 


Ala 


Asp 


Val 


Leu 


Glu 


Glu 


Thr 


His 












245 










250 










255 




40 


Tyr 


Glu 


Asn 


Gly 


Glu 


Tyr 


He 


He 


Arg 


Gin 


Gly 


Ala Arg Gly 


Asp 


Thr 










260 










265 










270 








Phe 


Phe 


He 


He 


Ser 


Lys 


Gly 


Thr 


val 


Asn 


val 


Thr 


Arg 


Glu 


Asp 


Ser 








275 










280 










285 










Pro 


Ser 


Glu 


Asp 


Pro 


Val 


Phe 


Leu 


Arg 


Thr 


Leu 


Gly 


Lys 


Gly 


Asp 


Trp 


45 




290 










295 










300 












Phe 


Gly 


Glu 


Lys 


Ala 


Leu 


Gin 


Gly 


Glu 


Asp 


Val 


Arg 


Thr 


Ala 


Asn 


Val 




305 










310 










315 










320 




He 


Ala 


Ala 


Glu 


Ala 


Val 


Thr 


Cys 


Leu 


Val 


He 


Asp 


Arg 


Asp 


Ser 


Phe 












325 










330 










335 




50 


Lys 


His 


Leu 


He 


Gly 


Gly 


Leu 


Asp 


Asp 


Val 


Ser 


Asn 


Lys 


Ala 


Tyr 


Glu 










340 










345 










350 








Asp 


Ala 


Glu 


Ala 


Lys 


Ala 


Lys 


Tyr 


Glu 


Ala 


Glu 


Ala 


Ala 


Phe 


Phe 


Ala 








355 










360 










365 










Asn 


Leu 


Lys 


Leu 


Ser 


Asp 


Phe 


Asn 


He 


He 


Asp 


Thr 


Leu 


Gly 


Val 


Gly 


55 




370 










375 










380 












Gly 


Phe 


Gly 


Arg 


Val 


Glu 


Leu 


Val 


Gin 


Leu 


Lys 


Ser 


Glu 


Glu 


Ser 


Lys 
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390 

Thr Phe Ala Met Lys He Leu Lys Lys Arg His He Val Asp Thr Arg 
405 

Gin Gin Glu His He Arg Ser Glu Lys Gin He Met Gin Gly Ala His 

425 

Ser Asp Phe He Val Arg Leu Tyr Arg Thr Phe Lys Asp Ser Lys Tyr 

435 440 
Leu Tyr Met Leu Met Glu Ala Cys Leu Gly Gly Glu Leu Trp Thr He 
r 460 

Leu Arg Asp Arg Gly Ser Phe Glu Asp Ser Thr Thr Arg Phe Tyr Thr 

470 475 
Ala cys Val Val Glu Ala Phe Ala Tyr Leu His Ser Lys Gly He He 

435 490 
Tyr Arg Asp Leu Lys Pro Glu Asn Leu He Leu Asp His Arg Gly Tyr 

500 gj^Q 
Ala Lys Leu Val Asp Phe Gly Phe Ala Lys Lys He Gly Phe Gly Lys 

515 520 525 

Lys Thr Trp Thr Phe Cys Gly Thr Pro Glu Tyr Val Ala Pro Glu He 
Tl r . 540 

He Leu Asn Lys Gly His Asp He Ser Ala Asp Tyr Trp Ser Leu Gly 
It "° 555 560 

He Leu Met Tyr Glu Leu Leu Thr Gly Ser Pro Pro Phe Ser Gly Pro 

565 570 575 

Asp Pro Met Lys Thr Tyr Asn He He Leu Arg Gly He Asp Met He 

580 585 590 

Glu Phe Pro Lys Lys He Ala Lys Asn Ala Ala Asn Leu He Lys Lys 

595 soo 605 

Leu Cys Arg Asp Asn Pro Ser Glu Arg Leu Gly Asn Leu Lys Asn Gly 



20 



30 



35 



40 



45 



50 



"° 615 620 



Val Lys Asp He Gin Lys His Lys Trp Phe Glu Gly Phe Asn Trp Glu 

"° 635 640 

Gly Leu Arg Lys Gly Thr Leu Thr Pro Pro He He Pro Ser Val Ala 

645 650 655 

Ser Pro Thr Asp Thr Ser Asn Phe Asp Ser Phe Pro Glu Asp Asn Asp 

660 665 670 

Glu Pro Pro Pro Asp Asp Asn Ser Gly Trp Asp He Asp Phe Ser Asp 

675 680 685 

Pro Pro val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 

695 700 
val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 

715 720 
Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 

"^25 730 735 

Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 

■^•40 745 750 

Thr Leu val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 

'55 760 765 

Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 

775 780 
Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 

790 795 nnn 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
805 810 815 

55 Asp Phe Lys Glu Asp Gly Asn He Leu Gly 

82 0 825 
His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala 
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835 840 B45 

Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn 

850 855 860 

He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
5 865 870 875 880 

Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 

885 890 895 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
900 905 910 

10 Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 
915 920 925 

Glu Leu Tyr Lys 
930 

15 (2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2184 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: CDNA 
<ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2181 
(D) OTHER INFORMATION: 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 8: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 .15 

35 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

40 GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
45 Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
^ Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

50 65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



55 



CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
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Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 



110 



GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTX3 AAG GGC 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTO GGG CAC AAG CTG GAG TAC 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
"° 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin lJs 

"° 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 
val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 
Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 
ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
^" 215 220 

Tl tII Alf t^"" T """^ "^"^ '^'^ *AG TCC 

val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 

"° 235 240 

GGA CTC AGA TCT CGA GGC ACC ATG AGC GAC GTG GCT ATT GTG AAG GAG 
Gly Leu Arg Ser Arg Gly Thr Met Ser Asp Val Ala He Val Lys Glu 
245 250 255 

GGT TGG CTG CAC AAA CGA GGG GAG TAC ATC AAG ACC TGG CGG CCA CGC 
Gly Trp Leu His Lys Arg Gly Glu Tyr He Lys Thr Trp Arg Pro Arg 
260 265 270 

TAC TTC CTC CTC AAG AAT GAT GGC ACC TTC ATT GGC TAC AAG GAG CGG 
Tyr Phe Leu Leu Lys Asn Asp Gly Thr Phe He Gly Tyr Lys Glu Arg 
275 280 285 

Pro f """^ ^''^ '^CC A*C AAC TTC TCT GTG 

Pro Gin Asp val Asp Gin Arg Glu Ala Pro Leu Asn Asn Phe Ser Val 

295 300 

Sa Sn of 7° t''^ C^ C<^C ^C TTC 

Ala Gin Cys Gin Leu Met Lys Thr Glu Arg Pro Arg Pro Asn Thr Phe 

315 320 
ATC ATC CGC TGC CTG CAG TGG ACC ACT GTC ATC GAA CGC ACC TTC CAT 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 
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lie lie Arg Cys 



Leu Gin Trp 
325 



285 

Thr Thr Val 
330 



lie Glu Arg Thr 



Phe His 
335 



GTG GAG ACT CCT GAG GAG CGG GAG GAG TGG ACA ACC GCC ATC CAG ACT 
Val Glu Thr Pro Glu Glu Arg Glu Glu Trp Thr Thr Ala lie Gin Thr 
340 345 350 



1056 



10 



GTG GCT GAC GGC CTC AAG AAG CAG GAG GAG GAG GAG ATG GAC TTC CGG 
Val Ala Asp Gly Leu Lys Lys Gin Glu Glu Glu Glu Met Asp Phe Arg 
355 360 365 



1104 



15 



TCG 
Ser 



CTG 
Leu 
385 



GGC 
Gly 
370 

GCC 
Ala 



TCA CCC 
Ser Pro 



AAG CCC 
Lys Pro 



AGT GAC AAC 
Ser Asp Asn 
375 

AAG CAC CGC 
Lys His Arg 
390 



TCA GGG GCT 
Ser Gly Ala 



GTG ACC ATG 
Val Thr Met 



GAA GAG ATG GAG 
Glu Glu Met Glu 
380 

AAC GAG TTT GAG 
Asn Glu Phe Glu 
395 



GTG TCC 
Val Ser 



TAC CTG 
Tyr Leu 
400 



1152 



1200 



20 



AAG 
Lys 



CTG 
Leu 



CTG GGC 
Leu Gly 



AAG GGC ACT 
Lys Gly Thr 
405 



TTC GGC AAG 
Phe Gly Lys 
410 



GTG ATC CTG GTG 
Val He Leu Val 



AAG GAG 
Lys Glu 
415 



1248 



25 



AAG GCC 
Lys Ala 



ACA GGC 
Thr Gly 
420 



CGC TAC TAC 
Arg Tyr Tyr 



GCC ATG AAG 
Ala Met Lys 
425 



ATC CTC 
lie Leu 



AAG AAG 
Lys Lys 
430 



GAA GTC 
Glu Val 



1296 



30 



ATC GTG 
He Val 



GCC AAG 
Ala Lys 
435 



GAC GAG GTG 
Asp Glu Val 



GCC CAC ACA 
Ala His Thr 
440 



CTC ACC 
Leu Thr 



GAG AAC 
Glu Asn 
445 



CGC GTC 
Arg Val 



1344 



35 



CTG CAG 
Leu Gin 
450 

CAG ACC 
Gin Thr 
465 



AAC TCC 
Asn Ser 



CAC GAC 
His Asp 



AGG CAC CCC 
Arg His Pro 
455 

CGC CTC TGC 
Arg Leu Cys 
470 



TTC CTC ACA 
Phe Leu Thr 



TTT GTC ATG 
Phe Val Met 



GCC CTG 
Ala Leu 
460 

GAG TAC 
Glu Tyr 
475 



AAG TAC 
Lys Tyr 



GCC AAC 
Ala Asn 



TCT TTC 
Ser Phe 



GGG GGC 
Gly Gly 
480 



1392 



1440 



40 



GAG CTG 
Glu Leu 



TTC TTC 
Phe Phe 



CAC CTG TCC 
His Leu Ser 
485 



CGG GAA CGT 
Arg Glu Arg 
490 



GTG TTC 
Val Phe 



TCC GAG 
Ser Glu 



GAC CGG 
Asp Arg 
495 



1488 



45 



GCC CGC 
Ala Arg 



TTC TAT 
Phe Tyr 
500 



GGC GCT GAG 
Gly Ala Glu 



ATT GTG TCA 
He Val Ser 
505 



GCC CTG 
Ala Leu 



GAC TAC 
Asp Tyr 
510 



CTG CAC 
Leu His 



1536 



50 



TCG GAG 
Ser Glu 



AAG AAC 
Lys Asn 
515 



GTG GTG TAC 
Val Val Tyr 



CGG GAC CTC 
Arg Asp Leu 
520 



AAG CTG 
Lys Leu 



GAG AAC 
Glu Asn 
525 



CTC ATG 
Leu Met 



1584 



55 



CTG GAC 
Leu Asp 
530 



AAG GAC 
Lys Asp 



GGG CAC ATT 
Gly His He 
535 



AAG ATC ACA 
Lys He Thr 



GAC TTC 
Asp Phe 
540 



GGG CTG 
Gly Leu 



TGC AAG 
Cys Lys 



GAG GGG ATC AAG GAC GGT GCC ACC ATG AAG ACC TTT TGC GGC ACA CCT 



1632 



1680 
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Glu Gly He Lys Asp Gly Ala Thr Met Lys Thr Phe Cys Gly Thr Pro 

550 555 

GAG TAG CTG GCC CCC GAG GTG CTG GAG GAG AAT GAG TAG GGC CGT GCA 1728 
5 Glu Tyr Leu Ala Pro Glu Val Leu Glu Asp Asn Asp Tyr Gly Arg Ala 

565 570 575 

GTG GAG TGG TGG GGG CTG GGC GTG GTC ATG TAG GAG AIG ATG TGC GGT 1776 
Val Asp Trp Trp Gly Leu Gly Val Val Met Tyr Glu Met Met Cys Gly 
10 580 585 590 



CGC CTG CCC TTC TAG AAC CAG GAC CAT GAG AAG CTT TTT GAG CTG ATG 
Arg Leu Pro Phe Tyr Asn Gin Asp His Glu Lys Leu Phe Glu Leu He 
595 600 605 

CTG ATG GAG GAG ATG CGC TTC CCG CGC ACG CTT GGT CCC GAG GCC AAG 
Leu Met Glu Glu He Arg Phe Pro Arg Thr Leu Gly Pro Glu Ala Lys 

6X5 620 

TCC TTG CTT TCA GGG CTG CTC AAG AAG GAC CCC AAG CAG AGG CTT GGC 
Ser Leu Leu Ser Gly Leu Leu Lys Lys Asp Pro Lys Gin Arg Leu Gly 
"5 630 635 640 

GGG GGC TCC GAG GAC GCC AAG GAG ATC ATG CAG CAT CGC TTC TTT GCC 1968 
Gly Gly Ser Glu Asp Ala Lys Glu He Met Gin His Arg Phe Phe Ala 
^45 650 655 

GGT ATC GTG TGG CAG CAC GTG TAG GAG AAG AAG CTC AGC CCA CCC TTC 2016 
Gly He Val Trp Gin His Val Tyr Glu Lys Lys Leu Ser Pro Pro Phe 
30 660 



15 



20 



25 



35 



40 



45 



665 670 

AAG CCC CAG GTC ACG TGG GAG ACT GAC ACC AGG TAT TTT GAT GAG GAG 
Lys Pro Gin Val Thr Ser Glu Thr Asp Thr Arg Tyr Phe Asp Glu Glu 

680 685 

TTC ACG GCC CAG ATG ATC ACC ATC ACA CCA CCT GAC CAA GAT GAC AGC 
Phe Thr Ala Gin Met He Thr He Thr Pro Pro Asp Gin Asp Asp Ser 

695 700 



TAC TCG GCC AGC AGC ACG GCC TGA 
Tyr Ser Ala Ser Ser Thr Ala 
725 



1824 



1872 



1920 



2064 



2112 



ATG GAG TGT GTG GAC AGC GAG CGC AGG CCC CAC TTC CCC CAG TTC TCC 2160 
Met Glu Cys Val Asp Ser Glu Arg Arg Pro His Phe Pro Gin Phe Ser 
■'^S 710 715 720 



2184 



(2) INFORMATION FOR SEQ ID NO -139 • 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 727 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
55 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

5 





Met 


Val 


Ser 


Lys 




Glu 


Glu 


Leu 


JCrXlC 


Thr 


Gly 


VclX 


VaX 


Pro 


He 


Leu 




1 

X 


















10 










X3 






Val 


Glu 


Leu 
eu 


Asp 


Gly 


sp 


Val 


Asn 


vaxy 


His 


Lys 




Ser 


VelX 


Ser 


uxy 




















25 










3 0 






10 




m \r 


Glu 


Gly 


sp 


/^Xcl 


X llX 


Tyr 


VjXy 


Lys 


Leu 


Thr 


T All 

j^eu 


Lys 


fne 


xxe 








•a c 










40 










45 










Cys 


X llx 


XIJ.X 


vjxy 


Lys 


Leu 


Pro 


vax 


Pro 


Trp 


Pro 


X nr 


Leu 


vax 


xxir 


xnr 






5 0 










55 










60 












Leu 


Thr 


Tyr 


ij j.y 


vax 


Lixn 


Cys 


Phe 


Ser 


Arg 


Tyr 


Pro 


Asp 


Hx S 


Met 


Lys 


1 o 


65 










70 










75 










80 






His 


Asp 


fne 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly 


Tyr 


vax 


CjXn 


Glu 












85 










90 










95 






Arg 


Thr 


lie 


Pne 


pne 


Lys 


Asp 


Asp 


Gly 


Asn Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 










100 










105 










110 






on 


vax 


Lys 


pne 


Glu 


Gly 


Asp 


Thr 


Leu 


Val 


Asn Arg 


He 


Glu 


Leu 


Lys 


Gly 








115 










120 










125 










lie 


Asp 


Pne 


Lys 


Glu 


Asp 


Gly 


Asn 


He 


Leu Gly 


His 


Lys 


Leu 


Glu 


Tyr 






130 










135 










140 












Asn 


Tyr 


Asn 


Ser 


xlxS 


Asn 


va± 


Tyr 


He 


Met 


Ala 


Asp 


Lys 


(jXn 


Lys 


Asn 


OR 


145 










150 










155 










160 




Gly 


lie 


Lys 


Val 


Asn 


T->1_ ^ 

pne 


Lys 


He 


Arg 


His 


Asn 


He 


Glu 


Asp 


Gly 


Ser 












165 










170 










175 






vaj. 




Leu 


Ax a 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


Pro 


He 


Gly 


Asp 


Gly 










180 










185 










190 








Pro 


vax 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 


Thr 


Gin 


Ser 


Ala 


Leu 








195 










200 










205 










Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 


His 


Met 


Val 


Leu 


Leu 


Glu 


pne 






210 










215 










220 












va± 


1 nr 


j\± a 


>\x a 


Gly 


X ±e 


Thr 


Leu 


Gly 


Met 


Asp 


Glu 


Leu 


Tyr 


Lys 


Ser 


oo 


22 5 










230 










235 










24 0 




Gly 


Leu 


Arg 


Ser 


Arg 


Gly 


Thr 


Met 


Ser 


Asp 


Val 


Ala 


He 


Val 


Lys 


Glu 












245 










250 










255 






Gly 


Trp 


Leu 


His 


Lys 


Arg 


Gly 


Glu 


Tyr 


He 


Lys 


Thr 


Trp 


Arg 


Pro 


Arg 










260 










265 










270 








Tyr 


Fne 


Leu 


Leu 


Lys 


Asn 


Asp 


Gly 


Thr 


Phe 


He 


Gly 


Tyr 


Lys 


Glu 


Arg 








275 










280 










285 










Pro 


Gin 


Asp 


Val 


Asp 


Gin 


Arg 


Glu 


Ala 


Pro 


Leu 


Asn 


Asn 


Phe 


Ser 


Val 






290 










295 










300 












Ala 


Gin 


Cys 


Gin 


Leu 


Met 


Lys 


Thr 


Glu 


Arg 


Pro 


Arg 


Pro 


Asn 


Thr 


Phe 


45 


305 










310 










315 










320 




He 


He 


Arg 


Cys 


Leu 


Gin 


Trp 


Thr 


Thr 


Val 


He 


Glu 


Arg 


Thr 


Phe 


His 












325 










330 










335 






Val 


Glu 


Thr 


Pro 


Glu 


Glu 


Arg 


Glu 


Glu 


Trp 


Thr 


Thr 


Ala 


He 


Gin 


Thr 










340 










345 










350 






50 


Val 


Ala 


Asp 


Gly 


Leu 


Lys 


Lys 


Gin 


Glu 


Glu 


Glu 


Glu 


Met 


Asp 


Phe 


Arg 








355 










360 










365 










Ser 


Gly 


Ser 


Pro 


Ser 


Asp 


Asn 


Ser 


Gly 


Ala 


Glu 


Glu 


Met 


Glu 


Val 


Ser 






370 










375 










380 












Leu 


Ala 


Lys 


Pro 


Lys 


His 


Arg 


Val 


Thr 


Met 


Asn 


Glu 


Phe 


Glu 


Tyr 


Leu 


55 


385 










390 










395 










400 




Lys 


Leu 


Leu 


Gly 


Lys 


Gly 


Thr 


Phe 


Gly 


Lys 


Val 


He 


Leu 


Val 


Lys 


Glu 
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405 410 
Lys Ala Thr Gly Arg Tyr Tyr Ala Met Lys He Leu Lys Lys Glu Val 

420 425 430 

He Val Ala Lys Asp Glu Val Ala His Thr Leu Thr Glu Asn Arg Val 

435 440 445 

Leu Gin Asn Ser Arg His Pro Phe Leu Thr Ala Leu Lys Tyr Ser Phe 

450 455 460 

Gin Thr His Asp Arg Leu Cys Phe Val Met Glu Tyr Ala Asn Gly Gly 
465 470 475 480 

Glu Leu Phe Phe His Leu Ser Arg Glu Arg Val Phe Ser Glu Asp Arg 

485 490 495 

Ala Arg Phe Tyr Gly Ala Glu He Val Ser Ala Leu Asp Tyr Leu His 

500 505 510 

Ser Glu Lys Asn Val Val Tyr Arg Asp Leu Lys Leu Glu Asn Leu Met 

515 520 525 

Leu Asp Lys Asp Gly His He Lys He Thr Asp Phe Gly Leu cys Lys 

530 535 540 

Glu Gly He Lys Asp Gly Ala Thr Met Lys Thr Phe Cys Gly Thr Pro 

550 555 560 

Glu Tyr Leu Ala Pro Glu Val Leu Glu Asp Asn Asp Tyr Gly Arg Ala 

565 570 575 

Val Asp Trp Trp Gly Leu Gly Val Val Met Tyr Glu Met Met Cys Gly 

580 585 590 

Arg Leu Pro Phe Tyr Asn Gin Asp His Glu Lys Leu Phe Glu Leu He 
25 595 600 605 

Leu Met Glu Glu He Arg Phe Pro Arg Thr Leu Gly Pro Glu Ala Lys 

615 620 
Ser Leu Leu Ser Gly Leu Leu Lys Lys Asp Pro Lys Gin Arg Leu Gly 

630 635 640 

Gly Gly Ser Glu Asp Ala Lys Glu He Met Gin His Arg Phe Phe Ala 

645 650 655 

Gly He Val Trp Gin His Val Tyr Glu Lys Lys Leu Ser Pro Pro Phe 

660 665 670 

Lys Pro Gin Val Thr Ser Glu Thr Asp Thr Arg Tyr Phe Asp Glu Glu 
35 675 680 685 

Phe Thr Ala Gin Met He Thr He Thr Pro Pro Asp Gin Asp Asp Ser 

690 695 700 

Met Glu Cys Val Asp Ser Glu Arg Arg Pro His Phe Pro Gin Phe Ser 
■^^^ 710 715 720 

40 Tyr Ser Ala Ser Ser Thr Ala 

725 

(2) INFORMATION FOR SEQ ID NO: 140: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



50 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 
55 (B) LOCATION; 1...2391 

(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 140: 

ATG GAG GAA CTG TTC CCC CTC ATC TTC CCG GCA GAG CCA GCC CAG GCC 4 8 

5 Met Asp Glu Leu Phe Pro Leu lie Phe Pro Ala Glu Pro Ala Gin Ala 
15 10 15 

TCT GGC CCC TAT GTG GAG ATC ATT GAG CAG CCC AAG CAG CGG GGC ATG 96 
Ser Gly Pro Tyr Val Glu He lie Glu Gin Pro Lys Gin Arg Gly Met 
10 20 25 30 

CGC TTC CGC TAG AAG TGC GAG GGG CGC TCC GCG GGC AGC ATC CCA GGC 144 

Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser Ala Gly Ser He Pro Gly 
35 40 45 

15 

GAG AGG AGC AC A GAT ACC ACC AAG ACC CAC CCC ACC ATC AAG ATC AAT 192 

Glu Arg Ser Thr Asp Thr Thr Lys Thr His Pro Thr He Lys He Asn 
50 55 60 

20 GGC TAC ACA GGA CCA GGG ACA GTG CGC ATC TCC CTG GTC ACC AAG GAC 240 
Gly Tyr Thr Gly Pro Gly Thr Val Arg He Ser Leu Val Thr Lys Asp 
65 70 75 80 

CCT CCT CAC CGG CCT CAC CCC CAC GAG CTT GTA GGA AAG GAC TGC CGG 288 
25 Pro Pro His Arg Pro His Pro His Glu Leu Val Gly Lys Asp Cys Arg 

85 90 95 

GAT GGC TTC TAT GAG GCT GAG CTC TGC CCG GAC CGC TGC ATC CAC AGT 336 
Asp Gly Phe Tyr Glu Ala Glu Leu Cys Pro Asp Arg Cys He His Ser 
30 100 105 110 

TTC CAG AAC CTG GGA ATC CAG TGT GTG AAG AAG CGG GAC CTG GAG CAG 384 

Phe Gin Asn Leu Gly He Gin Cys Val Lys Lys Arg Asp Leu Glu Gin 
115 120 125 

35 

GCT ATC AGT CAG CGC ATC CAG ACC AAC AAC AAC CCC TTC CAA GTT CCT 432 

Ala He Ser Gin Arg He Gin Thr Asn Asn Asn Pro Phe Gin Val Pro 
130 135 140 

40 ATA GAA GAG CAG CGT GGG GAC TAC GAC CTG AAT GCT GTG CGG CTC TGC 480 
He Glu Glu Gin Arg Gly Asp Tyr Asp Leu Asn Ala Val Arg Leu Cys 
145 150 155 160 

TTC CAG GTG ACA GTG CGG GAC CCA TCA GGC AGG CCC CTC CGC CTG CCG 528 
45 Phe Gin Val Thr Val Arg Asp Pro Ser Gly Arg Pro Leu Arg Leu Pro 

165 170 175 

CCT GTC CTT CCT CAT CCC ATC TTT GAC AAT CGT GCC CCC AAC ACT GCC 576 
Pro Val Leu Pro His Pro He Phe Asp Asn Arg Ala Pro Asn Thr Ala 
50 180 185 190 

GAG CTC AAG ATC TGC CGA GTG AAC CGA AAC TCT GGC AGC TGC CTC GGT 624 
Glu Leu Lys He Cys Arg Val Asn Arg Asn Ser Gly Ser Cys Leu Gly 
195 200 205 



55 



GGG GAT GAG ATC TTC CTA CTG TGT GAC AAG GTG CAG AAA GAG GAC ATT 672 
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Gly Asp Glu lie Phe Leu Leu Cys Asp Lys Val Gin Lys Glu Asp He 
210 215 220 

GAG GTG TAT TTC ACG GGA CCA GGC TGG GAG GCC CGA GGC TCC TTT TOG 720 
Glu val Tyr Phe Thr Gly Pro Gly Trp Glu Ala Arg Gly Ser Phe Ser 

230 235 240 

CAA GCT GAT GTG CAC CGA CAA GTG GCC ATT GTG TTC CGG ACC CCT CCC 768 
Gin Ala Asp Val His Arg Gin Val Ala He Val Phe Arg Thr Pro Pro 
245 250 255 

TAC GCA GAC CCC AGC CTG CAG GCT CCT GTG CGT GTC TCC ATG CAG CTG 816 
Tyr Ala Asp Pro Ser Leu Gin Ala Pro Val Arg Val Ser Met Gin Leu 
260 265 270 

CGG CGG CCT TCC GAC CGG GAG CTC AGT GAG CCC ATG GAA TTC CAG TAC 864 
Arg Arg Pro Ser Asp Arg Glu Leu Ser Glu Pro Met Glu Phe Gin Tyr 
275 280 285 

CTG CCA GAT ACA GAC GAT CGT CAC CGG ATT GAG GAG AAA CGT AAA AGG 912 
Leu Pro Asp Thr Asp Asp Arg His Arg He Glu Glu Lys Arg Lys Arq 
290 295 300 



960 



1008 



ACA TAT GAG ACC TTC AAG AGC ATC ATG AAG AAG AGT CCT TTC AGC GGA 
Thr Tyr Glu Thr Phe Lys Ser He Met Lys Lys Ser Pro Phe Ser Glv 
310 315 320 

CCC ACC GAC CCC CGG CCT CCA CCT CGA CGC ATT GCT GTG CCT TCC CGC 
Pro Thr Asp Pro Arg Pro Pro Pro Arg Arg He Ala Val Pro Ser Arg 
325 330 335 

AGC TCA GCT TCT GTC CCC AAG CCA GCA CCC CAG CCC TAT CCC TTT ACG 1056 
Ser Ser Ala Ser Val Pro Lys Pro Ala Pro Gin Pro Tyr Pro Phe Thr 
340 345 350 

TCA TCC CTG AGC ACC ATC AAC TAT GAT GAG TTT CCC ACC ATG GTG TTT 1104 
Ser Ser Leu Ser Thr He Asn Tyr Asp Glu Phe Pro Thr Met Val Phe 
355 360 365 

CCT TCT GGG CAG ATC AGC CAG GCC TCG GCC TTG GCC CCG GCC CCT CCC 
Pro Ser Gly Gin He Ser Gin Ala Ser Ala Leu Ala Pro Ala Pro Pro 
370 375 380 



1152 



CAA GTC CTG CCC CAG GCT CCA GCC CCT GCC CCT GCT CCA GCC ATG GTA 1200 
Gin Val Leu Pro Gin Ala Pro Ala Pro Ala Pro Ala Pro Ala Met Val 

390 395 

TCA GCT CTG GCC CAG GCC CCA GCC CCT GTC CCA GTC CTA GCC CCA GGC 124 8 
Ser Ala Leu Ala Gin Ala Pro Ala Pro Val Pro Val Leu Ala Pro Gly 
405 410 415 

CCT CCT CAG GCT GTG GCC CCA CCT GCC CCC AAG CCC ACC CAG GCT GGG 1296 
Pro Pro Gin Ala Val Ala Pro Pro Ala Pro Lys Pro Thr Gin Ala Gly 
420 425 430 

GAA GGA ACG CTG TCA GAG GCC CTG CTG CAG CTG CAG TTT GAT GAT GAA 



1344 
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Glu Gly Thr Leu Ser Glu Ala Leu Leu Gin Leu Gin Phe Asp Asp Glu 
435 440 445 

GAC CTG GGG GCC TTG CTT GGC AAC AGO ACA GAC CCA GCT GTG TTC ACA 1392 
5 Asp Leu Gly Ala Leu Leu Gly Asn Ser Thr Asp Pro Ala Val Phe Thr 
450 455 460 

GAC CTG GCA TCC GTC GAC AAC TCC GAG TTT CAG CAG CTG CTG AAC CAG 144 0 

Asp Leu Ala Ser Val Asp Asn Ser Glu Phe Gin Gin Leu Leu Asn Gin 
10 465 470 475 480 

GGC ATA OCT GTG GCC CCC CAC ACA ACT GAG CCC ATG CTG ATG GAG TAG 1488 
Gly lie Pro Val Ala Pro His Thr Thr Glu Pro Met Leu Met Glu Tyr 
485 490 495 

15 

CCT GAG GCT ATA ACT CGC CTA GTG ACA GGG GCC CAG AGG CCC CCC GAC 1536 
Pro Glu Ala He Thr Arg Leu Val Thr Gly Ala Gin Arg Pro Pro Asp 
500 505 510 

20 CCA GCT CCT GCT CCA CTG GGG GCC CCG GGG CTC CCC AAT GGC CTC CTT 1584 
Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly Leu Pro Asn Gly Leu Leu 
515 520 525 

TCA GGA GAT GAA GAC TTC TCC TCC ATT GCG GAC ATG GAC TTC TCA GCC 1632 
25 Ser Gly Asp Glu Asp Phe Ser Ser He Ala Asp Met Asp Phe Ser Ala 
530 535 540 

CTG CTG AGT CAG ATC AGC TCC TTG GAT CCA CCG GTC GCC ACC ATG GTG 1680 
Leu Leu Ser Gin He Ser Ser Leu Asp Pro Pro Val Ala Thr Met Val 
30 545 550 555 560 

AGC 7UVG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG 172 8 

Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu 
565 570 575 

35 

CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC 1776 

Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 

580 585 590 

40 GAG GGC GAT GCC ACC TAG GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC 1824 
Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr 
595 600 605 

ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC 1872 
45 Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 
610 615 620 

TAC GGC GTG CAG TGC TTC AGC CGC TAG CCC GAC CAC ATG AAG CAG CAC 1920 
Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His 
50 625 630 635 640 

GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC 1968 
Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr 
645 650 655 



55 



ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG 2 016 
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lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 

665 670 

TTC GAG GGC GAG ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC GAC 2064 
Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly He Asp 
^75 680 685 

TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC TAC 2112 
Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr 

695 700 

AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC 2160 
Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He 
''^^ 710 715 720 

AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG CAG 2208 
Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin 
725 730 735 

CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG 2256 
Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val 
740 745 750 

CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA 2304 
Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 
755 760 765 

GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC 2352 
Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 
770 775 780 

GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 23 94 

Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
785 790 795 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 797 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

Met Asp Glu Leu Phe Pro Leu He Phe Pro Ala Glu Pro Ala Gin Ala 

Ser Gly Pro Tyr Val Glu He He Glu Gin Pro Lys Gin Arg Gly Met 

20 25 30 

Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser Ala Gly Ser He Pro Gly 

35 40 45 

Glu Arg Ser Thr Asp Thr Thr Lys Thr His Pro Thr He Lys He Asn 
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50 55 60 





Gly 


Tyr 


Thr 


Gly 


Pro 


Gly 


Thr 


val 


Arg 


He 


Ser 


Leu 


Val 


Thr 


Lys 


Asp 




65 










70 










75 










80 




Pro 


Pro 


His 


Arg 


Pro 


His 


Pro 


His 


Glu 


Leu 


Val 


Gly 


Lys 


Asp 


Cys 


Arg 


c 
O 










85 










90 










95 






Asp 


Gly 


Pne 


Tyr 


Glu 


A J. a 


Glu 


Leu 


Cys 


Pro 


Asp 


Arg 


Cys 


He 


His 


Ser 










100 










105 










110 








File 


Vain 


Asn 


Leu 


VaXy 


T 1 ^» 

JL j.e 


Gin Cys 


vai 


Lys 


Lys 


Arg 


Asp 


Leu 


oXU 


V7in 








115 










120 










125 








in 


Ala 


lie 


Ser 


Gin 


Arg 


lie 


Gin 


Thr 


Asn 


Asn 


Asn 


Pro 


Pne 


Gin 


val 


Pro 






130 










135 










140 












He 


Glu 


Glu 


Gin 


Arg 


Gly 


Asp 


Tyr 


Asp 


Leu 


Asn 


Ala 


Val 


Arg 


Leu 


Cys 




145 










150 










155 










160 




Phe 


Gin 


Val 


Thr 


Val 


Arg 


Asp 


Pro 


Ser 


Gly Arg 


Pro 


Leu 


Arg 


Leu 


Pro 


1 5 










165 










170 










175 






Pro 


Val 


Leu 


Pro 


His 


Pro 


He 


Phe 


Asp 


Asn 


Arg 


Ala 


Pro 


Asn 


Thr 


Ala 










180 










185 










190 








Glu 


Leu 


Lys 


Tin 

lie 


Cys 


Arg 


Val 


Asn 


Arg 


Asn 


Ser 


Gly 


Ser 


Cys 


Leu 


Gly 








195 










200 










205 








20 


Gly 


Asp 


Glu 


He 


Phe 


Leu 


Leu 


Cys 


Asp 


Lys 


Val 


Gin 


Lys 


Glu 


Asp 


He 






210 










215 










220 












Glu 


Val 


Tyr 


Phe 


Thr 


Gly 


Pro 


Gly 


Trp 


Glu 


Ala 


Arg 


Gly 


Ser 


Phe 


Ser 




225 










230 










235 










240 




Gin 


Ala 


Asp 


Val 


His 


Arg 


Gin 


Val 


Ala 


He 


Val 


Phe 


Arg 


Thr 


Pro 


Pro 


25 










245 










250 










255 






Tyr 


Ala 


Asp 


Pro 


Ser 


Leu 


Gin 


Ala 


Pro 


Val 


Arg 


Val 


Ser 


Met 


Gin 


Leu 










260 










265 










270 








Arg 


Arg 


Pro 


Ser 


Asp 


Arg 


Glu 


Leu 


Ser 


Glu 


Pro 


Met 


Glu 


Phe 


Gin 


Tyr 








275 










280 










285 








30 


Leu 


Pro 


Asp 


Thr 


Asp 


Asp 


Arg 


His 


Arg 


He 


Glu 


Glu 


Lys 


Arg 


Lys 


Arg 






290 










295 










300 












Thr 


Tyr 


Glu 


Thr 


Phe 


Lys 


Ser 


He 


Met 


Lys 


Lys 


Ser 


Pro 


Phe 


Ser 


Gly 




305 










310 










315 










320 




Pro 


Thr 


Asp 


Pro 


Arg 


Pro 


Pro 


Pro 


Arg 


Arg 


He 


Ala 


Val 


Pro 


Ser 


Arg 


35 










325 










330 










335 






Ser 


Ser 


Ala 


Ser 


Val 


Pro 


Lys 


Pro 


Ala 


Pro 


Gin 


Pro 


Tyr 


Pro 


Phe 


Thr 










340 










345 










350 








Ser 


Ser 


Leu 


Ser 


Thr 


He 


Asn 


Tyr 


Asp 


Glu 


Phe 


Pro 


Thr 


Met 


Val 


Phe 








355 










360 










365 








40 


Pro 


Ser 


Gly 


Gin 


He 


Ser 


Gin 


Ala 


Ser 


Ala 


Leu 


Ala 


Pro 


Ala 


Pro 


Pro 






370 










375 










380 












Gin 


Val 


Leu 


Pro 


Gin 


Ala 


Pro 


Ala 


Pro 


Ala 


Pro 


Ala 


Pro 


Ala 


Met 


Val 




385 










390 










395 










400 




Ser 


Ala 


Leu 


Ala 


Gin 


Ala 


Pro 


Ala 


Pro 


Val 


Pro 


Val 


Leu 


Ala 


Pro 


Gly 


45 










405 










410 










415 






Pro 


Pro 


Gin 


Ala 


Val 


Ala 


Pro 


Pro 


Ala 


Pro 


Lys 


Pro 


Thr 


Gin 


Ala 


Gly 










420 










425 










430 








Glu 


Gly 


Thr 


Leu 


Ser 


Glu 


Ala 


Leu 


Leu 


Gin 


Leu 


Gin 


Phe 


Asp 


Asp 


Glu 








435 










440 










445 








50 


Asp 


Leu 


Gly 


Ala 


Leu 


Leu 


Gly Asn 


Ser 


Thr 


Asp 


Pro 


Ala 


Val 


Phe 


Thr 






450 










455 










460 












Asp 


Leu 


Ala 


Ser 


Val 


Asp 


Asn 


Ser 


Glu 


Phe 


Gin 


Gin 


Leu 


Leu 


Asn 


Gin 




465 










470 










475 










480 




Gly 


He 


Pro 


Val 


Ala 


Pro 


His 


Thr 


Thr 


Glu 


Pro 


Met 


Leu 


Met 


Glu 


Tyr 


55 










485 










4 90 










4 95 






Pro 


Glu 


Ala 


He 


Thr 


Arg 


Leu 


Val 


Thr 


Gly Ala 


Gin 


Arg 


Pro 


Pro 


Asp 
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505 

Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly Leu Pro Asn Gly Leu Leu 

515 520 525 

Ser Gly Asp Glu Asp Phe Ser Ser He Ala Asp Met Asp Phe Ser Ala 

530 535 540 

Leu Leu Ser Gin He Ser Ser Leu Asp Pro Pro Val Ala Thr Met Val 
550 555 560 

Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu 

565 570 575 

Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 

580 585 590 

Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr 

600 go5 
Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 

^^0 615 620 

Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His 

630 635 640 

Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr 

645 650 655 

He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 

660 665 670 

Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp 

675 680 685 

Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr 

650 695 700 

Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He 

710 715 720 

Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin 

725 730 735 

Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val 

740 745 750 

Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 

'755 760 765 

Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 

, '7'^^ 775 780 

Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
785 790 795 ^ ^ 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2391 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 2: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 
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Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
5 Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAG GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
10 35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

15 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

20 CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
25 Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
30 115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 4 32 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

130 135 140 

35 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 

145 150 155 160 

40 GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
45 Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
160 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
50 195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



55 



GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
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Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
230 235 240 

GGA CTC AGA TCT CGA GCC ATG GAC GAA CTG TTC CCC CTC ATC TTC CCG 768 
5 Gly Leu Arg Ser Arg Ala Met Asp Glu Leu Phe Pro Leu He Phe Pro 

245 250 255 

GCA GAG CCA GCC CAG GCC TCT GGC CCC TAT GTG GAG ATC ATT GAG CAG 816 
Ala Glu Pro Ala Gin Ala Ser Gly Pro Tyr Val Glu He He Glu Gin 
260 265 270 

CCC AAG CAG CGG GGC ATG CGC TTC CGC TAC AAG TGC GAG GGG CGC TCC 864 
Pro Lys Gin Arg Gly Met Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser 
275 280 285 

GCG GGC AGC ATC CCA GGC GAG AGG AGC ACA GAT ACC ACC AAG ACC CAC 912 
Ala Gly Ser He Pro Gly Glu Arg Ser Thr Asp Thr Thr Lys Thr His 
290 295 300 

CCC ACC ATC AAG ATC AAT GGC TAC ACA GGA CCA GGG ACA GTG CGC ATC 960 
Pro Thr He Lys He Asn Gly Tyr Thr Gly Pro Gly Thr Val Arg He 
310 315 320 

TCC CTG GTC ACC AAG GAC CCT CCT CAC CGG CCT CAC CCC CAC GAG CTT 1008 
Ser Leu Val Thr Lys Asp Pro Pro His Arg Pro His Pro His Glu Leu 
325 330 



15 



20 



25 



35 



40 



45 



55 



GTA GGA AAG GAC TGC CGG GAT GGC TTC TAT GAG GCT GAG CTC TGC CCG 1056 

P*^e Tyr Glu Ala Glu Leu 
345 350 



Val Gly Lys Asp Cys Arg Asp Gly Phe Tyr Glu Ala Glu Leu Cys Pro 
30 340 



GAC CGC TGC ATC CAC AGT TTC CAG AAC CTG GGA ATC CAG TGT GTG AAG 1104 
Asp Arg Cys He His Ser Phe Gin Asn Leu Gly He Gin Cys Val Lys 
355 360 365 

AAG CGG GAC CTG GAG CAG GCT ATC AGT CAG CGC ATC CAG ACC AAC AAC 1152 
Lys Arg Asp Leu Glu Gin Ala He Ser Gin Arg He Gin Thr Asn Asn 
370 375 380 

AAC CCC TTC CAA GTT CCT ATA GAA GAG CAG CCT GGG GAC TAC GAC CTG 1200 
Asn Pro Phe Gin Val Pro He Glu Glu Gin Arg Gly Asp Tyr Asp Leu 
390 395 

AAT GCT GTG CGG CTC TGC TTC CAG GTG ACA GTG CGG GAC CCA TCA GGC 1248 
Asn Ala Val Arg Leu Cys Phe Gin Val Thr Val Arg Asp Pro Ser Gly 
405 415 



AGG CCC CTC CGC CTG CCG CCT GTC CTT CCT CAT CCC ATC TTT GAC AAT 1296 
Arg Pro Leu Arg Leu Pro Pro Val Leu Pro His Pro He Phe Asp Asn 
420 425 430 

CGT GCC CCC AAC ACT GCC GAG CTC AAG ATC TGC CGA GTG AAC CGA AAC 1344 
Arg Ala Pro Asn Thr Ala Glu Leu Lys He Cys Arg Val Asn Arg Asn 
435 440 445 



TCT GGC AGC TGC CTC GGT GGG GAT GAG ATC TTC CTA CTG TGT GAC AAG 1392 
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Ser Gly Ser Cys Leu Gly Gly Asp Glu lie Phe Leu Leu Cys Asp Lys 
450 455 460 



GTG CAG AAA GAG GAG ATT GAG GTG 
5 Val Gin Lys Glu Asp lie Glu Val 
465 470 

GCC CGA GGC TCC TTT TCG CAA GCT 
Ala Arg Gly Ser Phe Ser Gin Ala 
10 485 



TAT TTC ACG GGA CCA GGC TGG GAG 1440 
Tyr Phe Thr Gly Pro Gly Trp Glu 
475 480 

GAT GTG CAC CGA CAA GTG GCC ATT 148 8 

Asp Val His Arg Gin Val Ala lie 
490 495 



GTG TTC CGG ACC CCT CCC TAC GCA GAC CCC AGC CTG CAG GCT CCT GTG 153 6 

Val Phe Arg Thr Pro Pro Tyr Ala Asp Pro Ser Leu Gin Ala Pro Val 
500 505 510 

15 

CGT GTC TCC ATG CAG CTG CGG CGG CCT TCC GAC CGG GAG CTC AGT GAG 1584 
Arg Val Ser Met Gin Leu Arg Arg Pro Ser Asp Arg Glu Leu Ser Glu 
515 520 525 

20 CCC ATG GAA TTC CAG TAC CTG CCA GAT ACA GAC GAT CGT CAC CGG ATT 1632 
Pro Met Glu Phe Gin Tyr Leu Pro Asp Thr Asp Asp Arg His Arg He 
530 535 540 

GAG GAG AAA CGT AAA AGG ACA TAT GAG ACC TTC AAG AGC ATC ATG AAG 1680 
25 Glu Glu Lys Arg Lys Arg Thr Tyr Glu Thr Phe Lys Ser He Met Lys 
545 550 555 560 

AAG AGT CCT TTC AGC GGA CCC ACC GAC CCC CGG CCT CCA CCT CGA CGC 172 8 

Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro Pro Arg Arg 
30 565 570 575 



ATT GCT GTG CCT TCC CGC AGC TCA 
He Ala Val Pro Ser Arg Ser Ser 
580 

35 

CAG CCC TAT CCC TTT ACG TCA TCC 
Gin Pro Tyr Pro Phe Thr Ser Ser 
595 600 



GCT TCT GTC CCC AAG CCA GCA CCC 1776 
Ala Ser Val Pro Lys Pro Ala Pro 
585 590 

CTG AGC ACC ATC AAC TAT GAT GAG 1824 
Leu Ser Thr lie Asn Tyr Asp Glu 
605 



40 TTT CCC ACC ATG GTG TTT CCT TCT GGG CAG ATC AGC CAG GCC TCG GCC 1872 

Phe Pro Thr Met Val Phe Pro Ser Gly Gin He Ser Gin Ala Ser Ala 
610 615 620 

TTG GCC CCG GCC CCT CCC CAA GTC CTG CCC CAG GCT CCA GCC CCT GCC 1920 
45 Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro Ala Pro Ala 
625 630 635 640 

CCT GCT CCA GCC ATG GTA TCA GCT CTG GCC CAG GCC CCA GCC CCT GTC 1968 
Pro Ala Pro Ala Met Val Ser Ala Leu Ala Gin Ala Pro Ala Pro Val 
50 645 650 655 



CCA GTC CTA GCC CCA GGC CCT CCT CAG GCT GTG GCC CCA CCT GCC CCC 2016 
Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro Pro Ala Pro 
660 665 670 

55 

AAG CCC ACC CAG GCT GGG GAA GGA ACG CTG TCA GAG GCC CTG CTG CAG 2 064 



297 
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Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala Leu Leu Gin 
675 680 685 

CTG CAG TTT GAT GAT GAA GAC CTG GGG GCC TTG CTT GGC AAC AGO ACA 2112 
Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly Asn Ser Thr 
690 695 700 

GAC CCA GCT GTG TTC ACA GAC CTG GCA TCC GTC GAC AAC TCC GAG TTT 2160 
Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn Ser Glu Phe 
705 710 715 720 

CAG CAG CTG CTG AAC CAG GGC ATA CCT GTG GCC CCC CAC ACA ACT GAG 2208 
Gin Gin Leu Leu Asn Gin Gly He Pro Val Ala Pro His Thr Thr Glu 
725 730 735 

CCC ATG CTG ATG GAG TAG CCT GAG GCT ATA ACT CGC CTA GTG ACA GGG 2256 
Pro Met Leu Met Glu Tyr Pro Glu Ala He Thr Arg Leu Val Thr Gly 
740 745 750 

GCC CAG AGG CCC CCC GAC CCA GCT CCT GCT CCA CTG GGG GCC CCG GGG 2304 
Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly 
755 760 765 

CTC CCC AAT GGC CTC CTT TCA GGA GAT GAA GAC TTC TCC TCC ATT GCG 2352 
Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser He Ala 
770 775 780 

GAC ATG GAC TTC TCA GCC CTG CTG AGT CAG ATC AGC TCC TAA 23 94 

Asp Met Asp Phe Ser Ala Leu Leu Ser Gin lie Ser Ser 
785 790 795 



(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 



Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu 


Leu 


Phe 


Thr 


Gly Val 


Val 


Pro 


He 


1 








5 










10 










15 


Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val 


Asn Gly 


His 


Lys 


Phe 


Ser 


Val 


Ser 








20 










25 








30 




Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 


Thr 


Leu 


Lys 


Phe 






35 










40 










45 




Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro 


Val 


Pro 


Trp 


Pro 


Thr 


Leu 


Val 


Thr 




50 










55 










60 








Leu 


Thr 


Tyr 


Gly 


Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 


Pro 


Asp 


His 


Met 


65 










70 










75 








Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly Tyr 


Val 


Gin 



80 
Glu 
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85 










90 










95 






Arg 


Thr 


lie 


Phe 


Phe 


Lys 


Asp 


Asp 


Gly 


Asn 


Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 










100 










105 










110 








Val 


Lys 


Phe 


Glu 


Gly 


Asp 


Thr 


Leu 


Val 


Asn 


Arg 


He 


Glu 


Leu 


Lys 


Gly 


5 






115 










120 










125 










lie 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly 


Asn 


He 


Leu 


Gly 


His 


Lys 


Leu 


Glu 


Tyr 






130 










135 










140 












Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


He 


Met 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 




145 










150 










155 










160 


10 


Gly 


lie 


Lys 


Val 


Asn 


Phe 


Lys 


He 


Arg 


His 


Asn 


He 


Glu 


Asp 


Gly 


Ser 












165 










170 










175 






Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


Pro 


He 


Gly 


Asp 


Gly 










180 










185 










190 








Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 


Thr 


Gin 


Ser 


Ala 


Leu 


15 






195 










200 










205 










Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 


His 


Met 


Val 


Leu 


Leu 


Glu 


Phe 






210 










215 










220 












Val 


Thr 


Ala 


Ala 


Gly 


lie 


Thr 


Leu 


Gly 


Met 


Asp 


Glu 


Leu 


Tyr 


Lys 


Ser 




225 










230 










235 










240 


20 


Gly 


Leu 


Arg 


Ser 


Arg 


Ala 


Met 


Asp 


Glu 


Leu 


Phe 


Pro 


Leu 


He 


Phe 


Pro 












245 










250 










255 






Ala 


Glu 


Pro 


Ala 


Gin 


Ala 


Ser Gly 


Pro 


Tyr 


val 


Glu 


He 


He 


Glu 


Gin 










260 










265 










270 








Pro 


Lys 


Gin 


Arg 


Gly 


Met 


Arg 


Phe 


Arg 


Tyr 


Lys 


Cys 


Glu Gly 


Arg 


Ser 


25 






275 










280 










285 










Ala 


Gly 


Ser 


lie 


Pro 


Gly 


Glu 


Arg 


Ser 


Thr 


Asp 


Thr 


Thr 


Lys 


Thr 


His 






290 










295 










300 












Pro 


Thr 


lie 


Lys 


lie 


Asn 


Gly Tyr Thr 


Gly 


Pro Gly 


Thr 


Val 


Arg 


He 




305 










310 










315 










320 


30 


Ser 


Leu 


Val 


Thr 


Lys 


Asp 


Pro 


Pro 


His 


Arg 


Pro 


His 


Pro 


His 


Glu 


Leu 












325 










330 










335 






Val 


Gly 


Lys 


Asp 


Cys 


Arg 


Asp 


Gly 


Phe 


Tyr 


Glu 


Ala 


Glu 


Leu 


Cys 


Pro 










340 










345 










350 








Asp 


Arg 


Cys 


lie 


His 


Ser 


Phe 


Gin 


Asn 


Leu 


Gly 


He 


Gin 


Cys 


Val 


Lys 


35 






355 










360 










365 










Lys 


Arg 


Asp 


Leu 


Glu 


Gin 


Ala 


He 


Ser 


Gin 


Arg 


He 


Gin 


Thr 


Asn 


Asn 






370 










375 










380 












Asn 


Pro 


Phe 


Gin 


Val 


Pro 


He 


Glu 


Glu 


Gin Arg 


Gly 


Asp 


Tyr 


Asp 


Leu 




385 










390 










395 










400 


40 


Asn 


Ala 


Val 


Arg 


Leu 


Cys 


Phe 


Gin 


Val 


Thr 


val 


Arg 


Asp 


Pro 


Ser 


Gly 












405 










410 










415 






Arg 


Pro 


Leu 


Arg 


Leu 


Pro 


Pro 


Val 


Leu 


Pro 


His 


Pro 


He 


Phe 


Asp 


Asn 










420 










425 










430 








Arg 


Ala 


Pro 


Asn 


Thr 


Ala 


Glu 


Leu 


Lys 


He 


Cys 


Arg 


Val 


Asn 


Arg 


Asn 


45 






435 










440 










445 










Ser 


Gly 


Ser 


Cys 


Leu 


Gly Gly Asp Glu 


He 


Phe 


Leu 


Leu 


Cys 


Asp 


Lys 






450 










455 










460 












Val 


Gin 


Lys 


Glu 


Asp 


He 


Glu 


Val 


Tyr 


Phe 


Thr 


Gly 


Pro 


Gly 


Trp 


Glu 




465 










470 










475 










480 


50 


Ala 


Arg 


Gly 


Ser 


Phe 


Ser 


Gin 


Ala 


Asp 


Val 


His 


Arg 


Gin 


Val 


Ala 


He 












485 










490 










495 






Val 


Phe 


Arg 


Thr 


Pro 


Pro 


Tyr 


Ala 


Asp 


Pro 


Ser 


Leu 


Gin 


Ala 


Pro 


Val 










500 










505 










510 








Arg 


Val 


Ser 


Met 


Gin 


Leu 


Arg Arg 


Pro 


Ser 


Asp 


Arg 


Glu 


Leu 


Ser 


Glu 


55 






515 










520 










525 










Pro 


Met 


Glu 


Phe 


Gin 


Tyr 


Leu 


Pro 


Asp 


Thr 


Asp 


Asp 


Arg 


His 


Arg 


He 
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530 535 540 

Glu Glu Lys Arg Lys Arg Thr Tyr Glu Thr Phe Lys Ser lie Met Lys 
545 550 555 560 

Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro Pro Arg Arg 

565 570 575 

He Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys Pro Ala Pro 

580 585 590 

Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr He Asn Tyr Asp Glu 

595 600 605 

Phe Pro Thr Met Val Phe Pro Ser Gly Gin He Ser Gin Ala Ser Ala 

615 620 
Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro Ala Pro Ala 
625 630 635 640 

Pro Ala Pro Ala Met Val Ser Ala Leu Ala Gin Ala Pro Ala Pro Val 

645 650 655 

Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro Pro Ala Pro 

660 665 670 

Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala Leu Leu Gin 

675 680 685 

Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly Asn Ser Thr 

690 695 700 

Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn Ser Glu Phe 
705 710 715 720 

Gin Gin Leu Leu Asn Gin Gly He Pro Val Ala Pro His Thr Thr Glu 

725 730 735 

Pro Met Leu Met Glu Tyr Pro Glu Ala He Thr Arg Leu Val Thr Gly 

740 745 750 

Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly 

755 760 765 

Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser He Ala 

770 775 780 

Asp Met Asp Phe Ser Ala Leu Leu Ser Gin He Ser Ser 
785 790 795 
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CLAIMS 



10 



15 



20 



25 



1. A method for extracting quantitative information relating to an influence on a cellular re- 
sponse, the method comprising recording variation, caused by the influence on a mechani- 
cally intact living cell or mechanically intact living cells, in spatially distributed light emitted 
from a luminophore, the luminophore being present in the cell or cells and being capable of 
being redistributed in a manner which is related with the degree of the Influence, and/or of 
being modulated by a component which is capable of being redistributed in a manner which 
is related to the degree of the influence, the association resulting in a modulation of the lu- 
minescence characteristics of the luminophore, and processing the recorded variation in the 
spatially distributed light to provide quantitative information correlating the spatial distribution 
to the degree of the influence on the cellular response. 

2. A method according to claim 1 , as used for extracting quantitative information relating to 
an influence on an intracellular pathway involving redistribution of at least one component 
associated with the pathway, or part thereof, the method comprising recording the result of 
the influence on mechanically intact living cell or cells, as manifested in spatially distributed 
light emitted from a luminophore which is present in the cell or cells and which is capable of 
being redistributed, by modulation of the pathway, in a manner which is related to the redis- 
tribution of the at least one component of the intracellular pathway, processing the recorded 
result to provide quantitative information about the spatially distributed light and correlating 
the quantitative infonmation to the degree of the influence on the intracellular pathway. 

3. A method according to claim 1 or 2, wherein the quantitative Information which is indica- 
tive of the degree of the cellular response to the influence or the result of the influence on 
the intracellular pathway is extracted from the recording or recordings according to a prede- 
termined calibration based on responses or results, recorded In the same manner, to known 
degrees of a relevant specific influence. 

4. A method according to any of the preceding claims, wherein the influence is contact be- 
tween the mechanically intact living cell or the group of mechanically intact living cells with a 
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Chemical substance and/or incubation of the meclianically intact living cell or the group of 
mechanically intact living cells with a chemical substance. 



5. A method according to claim 4 wherein the substance is a substance whose effect on an 
intracellular pathway is to be determined. 



6. A method according to any of the preceding claims, wherein the recording is made at a 
single point in time after the application of the influence. 



7. A method according to any of claims 1-5. wherein the recording is made at two points in 
time, one point being before, and the other point being after the application of the influence. 

8. A method according to any of claims 1-5, wherein the recording is performed at a series of 
points in time, in which the application of the influence occurs at some time after the first 
time point in the series of recordings, the recording being performed, e.g.. with a predeter- 
mined time spacing of from 0.1 seconds to 1 hour, preferably from 1 to 60 seconds, more 
preferably from 1 to 30 seconds, in particular from 1 to 10 seconds, over a time span of from 
1 second to 12 hours, such as from 10 seconds to 12 hours, e.g.. from 10 seconds to one 
hour, such as from 60 seconds to 30 minutes or 20 minutes. 



9. A method according to any of claims 1-7. wherein the cell or cells is/are fixed at a point in 
time after the application of the influence at which the response has been predetermined to 
be significant, and the recording is made at an arbitrary later time. 



10. A method according to any of the preceding claims, wherein the luminophore is a lu- 
minophore which is capable of being redistributed in a manner which is physiologically rele- 
vant to the degree of the influence. 
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1 1 . A method according to any of the preceding claims, wherein the luminophore Is a lu- 
mlnophore which is capable of associating with a component which is capable of being re- 
distributed in manner which is physiologically relevant to the degree of the influence. 



5 12. A method according to any of the preceding claims, wherein the luminophore Is a lu- 
minophore which is capable of being redistributed in a manner which is experimentally de- 
termined to be correlated to the degree of the influence. 

13. A method according to any of the preceding claims, wherein the luminophore is a lu- 
10 minophore which Is capable of being redistributed, by modulation of the Intracellular path- 
way, in substantially the same manner as the at least one component of the intracellular 
pathway. 

14. A method according to any of claims 1-13. wherein the luminophore is a luminophore 
15 which is capable of being quenched upon spatial association with a component which is re- 
distributed by modulation of the pathway, the quenching being measured as a decrease in 
the intensity of the luminescence. 

15. A method according to any of claims 1-13. wherein the variation or result with respect to 
20 the spatially distributed light emitted by the luminophore is detected by a change in the reso- 
nance energy transfer between the luminophore and another luminescent entity capable of 
delivering energy to the luminophore, each of which has been selected or engineered to be- 
come part of, bound to or associated with particular components of the intracellular pathway, 
and one of which undergoes redistribution in response to the influence, thereby changing the 

25 amount of resonance energy transfer, the change In the resonance energy transfer being 
measured as a change in the intensity of emission from the luminophore. 

16. A method according to claim 15, wherein the change in the Intensity of the emission from 
the luminophore is sensed by a single channel photodetector which responds only to the av- 

30 erage intensity of the luminophore in a non-spatially resolved fashion 
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17. A method according to any of claims 1-16. wherein the property of the light being re- 
corded is intensity, fluorescence lifetime, polarization, wavelength shift, or other property 
which is modulated as a result of the underlying cellular response. 

18. A method according to any of claims 1-15 or 17, wherein the recording of the spatially 
distributed light is performed using a recording system which records the spatial distribution 
of a recordable property of the light in the form of an ordered an-ay of values. 

19. A method according to claim 18, wherein the recording of the spatial distribution of the 
recordable property of the light is performed using a charge transfer device such as a CCD 
array or a vacuum tube device such as a vidicon tube. 

20. A method according to any of the preceding claims, wherein the light to be measured 
passes through a filter which selects the desired component of the light to be measured and 
rejects other components. 

21. A method according to any of the preceding claims, wherein the recording of the spatial 
distribution of the recordable property of light is performed by fluorescence microscopy. 

22. A method according to any of the preceding claims, wherein the recording of the varia- 
tion or result with respect to light emitted from the luminophore is performed by recording the 
spatially distributed light as one or more digital images, and the processing of the recorded 
variation to reduce It to one or more numbers representative of the degree of redistribution 
comprises a digital image processing procedure or combination of digital image processing 
procedures. 

23. A method according to any of claims 2-22. wherein the intracellular pathway is an intra- 
cellular signalling pathway. 
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24. A method according to any of the preceding claims, wherein the luminophore is a fluoro- 
phore. 

25. A method according to any of the preceding claims wherein the luminophore is a 
polypeptide encoded by and expressed from a nucleotide sequence harboured in the cell or 
cells. 

26. A method according to any of the preceding claims, wherein the luminophore is a hybrid 
polypeptide comprising a fusion of at least a portion of each of two polypeptides one of which 
comprises a luminescent polypeptide and the other one of which comprises a biologically 
active polypeptide, as defined herein. 

27. A method according to claim 26. wherein the luminescent polypeptide is a GFP as de- 
fined herein. 

28. A method according to claim 27 wherein the GFP is selected from the group consisting 
of green fluorescent proteins having the F64L mutation as defined herein. 

29. A method according to claim 28 wherein the GFP is a GFP variant selected from the 
group consisting of F64L-GFP. F64L-Y66H-GFP, F64L-S65T-GFP, and EGFP. 

30. A method according to any of the previous claims for detecting intracellular translocation 
of a biologically active polypeptide affecting intracellular processes upon activation, the 
method comprising 

a) cuituring one or more cells containing a nucleotide sequence coding for a hybrid 
polypeptide comprising a GFP which is N- or C-terminally tagged, optionally through 
a linker, to a biologically active polypeptide under conditions permitting expression of 
the nucleotide sequence. 
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b) modulating the activity of the biologically active polypeptide by Incubating the cell or 
cells with a substance having biological activity and 

c) measuring the fluorescence produced by the Incubated cell or cells and determining 
the result or variation with respect to the fluorescence, such result or variation being 
indicative of the translocation of a biologically active polypeptide in said cell. 

31. A method according to claim 30, wherein the nucleotide sequence is a DNA sequence. 

32. A method according to claim 30 or 31, wherein the modulation Is an activation. 

33. A method according to claim 30 or 31, wherein the modulation is a deactivation. 

34. A method according to any of claims 30-33 wherein the fluorescence of the cell or cells is 
measured prior to the modulation, and the result or variation determined in step (c) is a 
change in fluorescence compared to the fluorescence measured prior to the modulation. 

35. A method according to any of claims 30-34. wherein the intracellular processes are Intra- 
cellular signalling pathways. 

36. A method according to claim 34, wherein the change in fluorescence measured in step 
(c) comprises determining a change in the spatial distribution of the fluorescence. 

37. A method according to any of the preceding claims wherein the mechanically Intact living 
cell or cells is/are a mammalian cell/mammalian cells which, during the time peroid over 
which the influence is observed, is/are incubated at a temperature of 30X or above, pref- 
erably at a temperature of from 32X to 39-C, more preferably at a temperature of from 35X 
to 38'C, and most preferably at a temperature of about 37*C. 
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38. A method according to any of the preceding claims, wherein the at least one mechani- 
cally intact living cell is part of a matrix of identical or non-identical cells. 

39. A method according to any of claims 1-36 and 38, wherein the cell or cells is/are se- 
lected from the group consisting of fungal cells, such as a yeast cell; invertebrate cells in- 
cluding insect cells; and vertebrate cells, such as mammalian cells. 

40. A nucleic acid construct coding for a fusion polypeptide comprising a biologically active 
polypeptide that is a component of an intracellular signalling pathway, or a part thereof, and 
a GFP, with the proviso that the construct is not a construct coding for a fusion polypeptide 
in which the biologically active polypeptide is selected from the group consisting of PKC- 
alpha. PKC-gamma. and PKC-epsilon. 

41. A nucleic acid construct coding for a fusion polypeptide comprising a biologically active 
polypeptide that is a component of an intracellular signalling pathway, or a part thereof, and 
an F64L mutant of GFP. 

42. A nucleic acid construct according to claim 40 or 41, wherein the biologically active 
polypeptide is a protein kinase or a phosphatase. 

43. A nucleic add construct according to any of claims 40-42 wherein the GFP is N- or C- 
tenmlnally tagged, optionally via a peptide linker, to the biologically active polypeptide or part 
thereof. 

44. A nucleic acid construct according to any of claims 40, 41 and 43, wherein the biologi- 
cally active polypeptide is a transcription factor or a part thereof which changes cellular lo- 
calisation upon activation. 
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45. A nucleic add construct according to any of claims 40. 41 and 43, wherein the biologi- 
cally active polypeptide is a protein, or a part thereof, which is associated with the cy- 
toskeletal network and which changes cellular localisation upon activation. 



46. A nucleic acid construct according to any of claims 40-43, wherein the biologically active 
polypeptide is a protein kinase or a part thereof which changes cellular localisation upon ac- 
tivation. 



47. A nucleic add construct according to daim 46, wherein the protein kinase is a ser- 
ine/threonine protein kinase or a part thereof capable of changing intracellular localisation 
upon activation. 



48. A nucleic add construct according to daim 46. wherein the protein kinase is a tyrosine 
protein kinase or a part thereof capable of changing intracellular localisation upon activation. 



49. A nudeic acid construct according to daim 46. wherein the protein kinase is a phosphol- 
ipid-dependent serine/threonine protein kinase or a part thereof capable of changing intra- 
cellular localisation upon activation. 



20 50. A nudeic add construct according to daim 46. wherein the protein kinase is a cAMP- 
dependent protein kinase or a part thereof capable of changing cellular localisation upon ac- 
tivation. 



51. A nucleic acid construct according to daim 50 which codes for a PKAc-F64L-S65T-GFP 
25 fusion. 



52. A nucleic acid constmct according to daim 46, wherein the protein kinase is a cGMP- 
dependent protein kinase or a part thereof capable of changing cellular localisation upon ac- 
tivation. 



15 
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53. A nucleic acid construct according to claim 46. wherein the protein kinase is a 
calmodulin-dependent serine/threonine protein kinase or a part thereof capable of changing 
cellular localisation upon activation. 

54. A nucleic acid construct according to claim 46, wherein the protein kinase is a mitogen- 
activated serine/threonine protein kinase or a part thereof capable of changing cellular local- 
isation upon activation. 

55. A nucleic add construct according to claim 54, which codes for an ERK1-F64L-S65T- 
GFP fusion. 

56. A nucleic acid construct according to claim 54. which codes for an EGFP-ERK1 fusion. 

57. A nucleic acid construct according to claim 46. wherein the protein kinase is a cyclin- 
dependent serine/threonine protein kinase or a part thereof capable of changing cellular lo- 
calisation upon activation. 

58. A nucleic acid construct according to claim 42 or 43. wherein the biologically active 
polypeptide is a protein phosphatase or a part thereof capable of changing cellular localisa- 
tion upon activation. 

59. A nucleic acid construct according to any of claims 40-58 which is a DNA construct. 

60. A nucleic acid construct according to any of claims 40-59 wherein the gene encoding 
GFP is derived from Aequorea victoria. 

61 . A nucleic acid construct according to claim 60 in which the gene encoding GFP is the 
gene encoding EGFP as defined herein. 
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62. A nucleic acid construct according to claim 60 in which the gene encoding a GFP is a 
gene encoding a GFP variant selected from F64L-GFP, F64L-Y66H-GFP and F64L-S65T- 
GFP. 

63. A DNA construct according to claim 59 and 61 or. where applicable. 62, which is a con- 
struct as identified by any of the DNA sequences shown in SEQ ID NO: 38. 40, 42. 44. 46, 
48, 50. 52. 54, 56. 58, 60, 62, 64, 66. 68. 70, 72, 74, 76. 78, 108, 110. 112, 114. 116. 118. 
120. 122. 124. 126, 128. 130. 132, 134. 136. 138. 140. and 142. or is a variant thereof ca- 
pable of encoding the same fusion polypeptide or a fusion polypeptide which is biologically 
equivalent thereto, as defined herein. 

64. A cell containing a nucleic acid construct according to any of claims 40-63 and capable 
of expressing the sequence encoded by the construct. 

65. A cell according to claim 64. which is a eukaryotic cell. 

66. A cell according to claim 64. which is selected from the group consisting of fungal cells, 
such as yeast cells; invertebrate cells, including insect cells, and vertebrate cells, such as 
mammalian cells. 

67. A cell according to claim 66. which is a mammalian cell. 

68. An organism carrying in at least one of its component cells a nucleic acid sequence as 
contained in the constructs according to any of claims 40-59. said cell being capable of ex- 
pressing said nucleic acid sequence. 

69. An organism according to claim 68 which is selected from the group consisting of uni- 
cellular and multicellular organisms, such as a mammal. 
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70. A fluorescent probe comprising a GFP which Is N- or C-terminally tagged, optionally via a 
peptide linker, to a biologically active polypeptide or a part or a subunit thereof which Is a 
connponent of a intracellular signalling pathway as defined herein, the probe being a probe 
which is encoded by the nucleic acid construct according to any of claims 40-59. 

71. A method according to any of claims 1-39. wherein the lumlnophore is a fusion polypep- 
tide as encoded by the nucleic acid construct according to any of claims 40-63. 

72. A method according to any of claims 1-39 or 71 In which the method of the invention is 
used in a screening program as defined herein. 

73. An apparatus for measuring the distribution of fluorescence in at least one cell, and 
thereby any change in the distribution of fluorescence in at least one cell, which includes the 
following component parts: (a) a light source, (b) a means for selecting the wavelength(s) of 
light from the source which will excite the fluorescence of the protein, (c) a means for rapidly 
blocking or pass ing the excitation light into the rest of the system, (d) a series of optical 
elements for conveying the excitation light to the specimen, collecting the emitted fluores- 
cence in a spatially resolved fashion, and forming an image from this fluorescence, (e) a 
bench or stand which holds the container of the cells being measured in a predetermined 
geometry with respect to the series of optical elements, (f) a detector to record the spatially 
resolved fluorescence in the form of an image, (g) a computer or electronic system and as- 
sociated software to acquire and store the recorded images, and to compute the degree of 
redistribution from the recorded images. 

74. An apparatus according to claim 73 in which some or all of the system is automated. 

75. An apparatus according to claim 73 in which components d and e comprise a fluores- 
cence microscope. 
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76. An apparatus according to claim 73 in wliich component f is a CCD 



camera. 



77. An apparatus according to claim 73 in which the image is fomied and recorded by an 
optical scanning system. 



78. An apparatus according to claim 73 in which a liquid addition system is used to add a 
known or unknown compound to any or all of the cells in the cell holder at a time detemiined 
in advance. 



79. An apparatus according to claim 78 In which the liquid addition system is under the con- 
trol of the computer or electronic system. 



80. A method according to any of claims 1-79 wherein the method is a screening program for 
the identification of a biologically active substance as defined herein that directly or indirectly 
affects an intracellular signalling pathway and is potentially useful as a medicament, wherein 
the result of the individual measurement of each substance being screened which indicates 
its potential biological activity is based on measurement of the redistribution of spatially re- 
solved luminescence in living cells and which undergoes a change in distribution upon acti- 
vation of an intracellular signalling pathway. 

81 A method according to any of claims 1-79 wherein the method is a screening program for 
the identification of a biologically toxic substance as defined herein that exerts its toxic effect 
by interfering with an intracellular signalling pathway, wherein the result of the individual 
measurement of each substance being screened which indicates Its potential biologically 
toxic activity is based on measurement of the redistribution of said fluorescent probe in living 
cells and which undergoes a change in distribution upon activation of an intracellular signal- 
ling pathway. 



82. A method according to any of claims 1 -80 wherein a fluorescent probe is used in back- 
tracking of signal transduction pathways as defined herein. 
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83. A method of treating a condition or disease related to the intracellular function of a pro- 
tein kinase comprising administering to a patient suffering from said condition or disease an 
effective amount of a compound which has been discovered by any method according to the 

5 invention. 

84. A compound that modulates a component of an Intracellular pathway as defined herein, 
as determined by a method according to the method of the invention. 

10 85. A medical composition comprising a therapeutic amount of a compound identified ac- 
cording the method of the invention. 

86. A method of selectively treating a patient suffering from an ailment which responds to 
medical treatment comprising obtaining a primary cell or cells from said patient, transfecting 

15 the cell or cells with at least one DNA sequence encoding a fluorescent probe according to 
the invention, culturing the ceil or cells under conditions permitting the expression of said 
probes and exposing it to an array of medicaments suspected of being capable of alleviating 
said ailment, then comparing changes in fluorescence pattems or redistribution patterns of 
the fluorescent probes in the intact living cell or cells to detect the cellular response to the 

20 specific medicaments (obtaining a cellular action profile), then selecting a medicament(s) 
based on desired activity and acceptable level of side effects and administering an effective 
amount of said medicament(s) to said patient. 

87. A method according to any of claims 1-80 of identifying a drug target among the group of 
25 biologically active polypeptides which are components of intracellular signalling pathways. 
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Claims Nos.: 85,86 



The subject-matter (compounds per se) is solely characterised in claims 
85 and 86 by the result to be achieved, no support of a technical 
character is derivable from the description for the technical formulation 
of the subject of the search, accordingly no scope of a search could be 
defined and a meaningful! search is hence not possible. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: Partially: 1-43, 46, 59-82 and 88; Entirely: 47. 
49, 53-57 



Methods for extracting information from influences on a 
living cell involving observing spatial redistribution or 
modulation of a luminophore linked to a biologically active 
molecule, in particular to a molecule involved in 
intracellular signalling pathways, nucleic acids encoding 
fusion proteins comprising bothe the luminophore and the 
biological active molecule, cells containing and expressing 
these nucleic acids, as well as methods and apparatuses 
involving above products, inso far as related to the 
biologically active protein being serine/threonine protein 
kinases 



2. Claims: Partially: 1-41, 43, 59-82 and 88; Entirely: 48 

Methods for extracting information from influences on a 
living cell involving observing spatial redistribution or 
modulation of a luminophore linked to a biologically active 
molecule, in particular to a molecule involved in 
intracellular signalling pathways, nucleic acids encoding 
fusion proteins comprising bothe the luminophore and the 
biological active molecule, cells containing and expressing 
these nucleic acids, as well as methods and apparatuses 
involving above products, inso far as related to the 
biologically active protein being to tyrosine kinases 



3. Claims: Partially: 1-43, 46, 59-82 and 88; Entirely: 50, 51 



MMethods for extracting information from influences on a 
living cell involving observing spatial redistribution or 
modulation of a luminophore linked to a biologically active 
molecule, in particular to a molecule involved in 
intracellular signalling pathways, nucleic acids encoding 
fijsion proteins comprising bothe the luminophore and the 
biological active molecule, cells containing and expressing 
these nucleic acids, as well as methods and apparatuses 
involving above products, inso far as related to the 
biologically active protein being to cAMP dependent protein 
kinases. 



4. Claims: Partially: 1-43, 46, 59-82 and 88; Entirely: 52 

MMethods for extracting information from influences on a 
living cell involving observing spatial redistribution or 
modulation of a luminophore linked to a biologically active 
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molecule, in particular to a molecule involved in 
intracellular signalling pathways, nucleic acids encoding 
fusion proteins comprising bothe the luminophore and the 
biological active molecule, cells containing and expressing 
these nucleic acids, as well as methods and apparatuses 
involving above products, inso far as related to the 
biologically active protein being cGMP dependent protein 
kinases 



5. Claims: Partially: 1-43. 59-82 and 88; Entirely: 58 

Methods for extracting information from influences on a 
living cell involving observing spatial redistribution or 
modulation of a luminophore linked to a biologically acti 
molecule, in particular to a molecule involved in 
intracellular signalling pathways, nucleic acids encoding 
fusion proteins comprising bothe the luminophore and the 
biological active molecule, cells containing and expressing 
these nucleic acids, as well as methods and apparatuses 
involving above products, inso far as related to the 
biologically active protein being protein phosphatases 



ve 



6. Claims: Partially: 1-41, 43, 59-82 and 88; Entirely: 44 

Methods for extracting information from influences on a 
living cell involving observing spatial redistribution or 
modulation of a luminophore linked to a biologically active 
molecule, in particular to a molecule involved in 
intracellular signalling pathways, nucleic acids encoding 
fusion proteins comprising bothe the luminophore and the 
biological active molecule, cells containing and expressing 
these nucleic acids, as well as methods and apparatuses 
involving above products, inso far as related to the 
biologically active protein being to transcription factors 

7. Claims: Partially: 1-41, 43, 59-82 and 88; Entirely: 45 

Methods for extracting information from influences on a 
living cell involving observing spatial redistribution or 
modulation of a luminophore linked to a biologically active 
molecule, in particular to a molecule involved in 
intracellular signalling pathways, nucleic acids encoding 
fusion proteins comprising bothe the luminophore and the 
biological active molecule, cells containing and expressing 
these nucleic acids, as well as methods and apparatuses 
involving above products, inso far as related to the 
biologically active protein being to proteins associated 
with the cytoskeletal network 
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